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(54) NOVEL Beta 1 -4 N-ACETYLGLUCOSAMIN YLTR ANSFERASE AND GENE ENCODING THE 
SAME 

(57) The present invention relates to a novel 
enzyme having pi -> 4 N-acetylglucosaminyftrans- 
f erase (GnT-IV) activity; a gene encoding the enzyme; a 
recombinant DNA comprising the gene; a host cell com- 
prising the recombinant DNA; a method for producing 
an enzyme protein having GnT-IV activity comprising 
culturtng the host cell in a medium; and a saccharide in 
which the sugar chain is modified using a GnT-IV. 

According to the present invention, a novel GnT-iV t 
a method for producing the enzyme and a gene coding 
for the enzyme are provided. With the GnT-IV of the 
present invention, it has become possible to produce a 
saccharide having a branching structure which could 
not be formed with conventional, glycosyltransferases. 
Thus, the GnT-IV of the invention is useful not only for 
producing or improving glycoconjugate type pharma- 
ceuticals, reagents and foods, but also for modifying the 
sugar chain structure of any biopolymer. 



EP0 905 232 A1 



Description 

Technical Field 

[0001 ] The present invention relates to a novel N-acetylglucosaminyltransferase (GlcNAc transferase) which recog- 
nizes a specific sugar chain structure in a saccharide and introduces thereinto a GlcNAc p1 -> 4 branching structure. 

Background Art 
1. Glycoproteins 

[0002] Most of proteins occurring in nature are not simple proteins composed of amino acids alone, but "mature" pro- 
teins having sugar chains and other substances such as phosphates and lipids attaching thereto. Therefore, the devel- 
opment of simple protein-type products produced by Escherichia colt as a host has involved various problems because 
such products lack the maturing process of proteins. Since all of secretion-type physiologically active proteins (eg. 
cytokines) are glycoproteins with a few exceptions, the function and the role of sugar chains have attracted attention as 
the most important point in the development of biological pharmaceuticals. 

[0003] Sugar chains in glycoproteins are classified roughly into Asn-linked type, mucin-type. O-finked GlcNAc type. 
GPI anchor type and proteoglycan type [Makoto Takeuchi, "Glycobiology Series 5: Gycotechnology". Kihata, Hakomori 
and Nagai (eds.), Kodansha Scientific Co., (1994), 191-208]. Each of these types of sugar chains has its own biosyn- 
thesis pathway and a discrete physiological function. Asn-linked sugar chains are distributed widely in molds, yeasts, 
insects, plants and animals. The basic biosynthesis pathway for Asn-linked sugar chains is conserved beyond species 
(Rg. 1). A sugar chain(s) characteristic of a specific species is(are) formed on the outer side (called the "non-reducing 
terminal side") of the core sugar chain moiety which is common in the biosynthesis of Asn-linked sugar chains. A man- 
nan-type sugar chain in which a1,3- and a1 ,2-branching mannose residues attach to a main chain extending via a1 ,6 
linkages is a sugar chain structure characteristic of fungi such as yeasts (see Panel a in Fig. 2) [Hiroshi Nakajima. 
Sugar Chain Technology. Industry Survey Association (1992). 384-397]. On the other hand, in insects, plants and ani- 
mals, extension of mannose residues is not observed; instead, a high mannose type sugar chain is formed which is a 
sugar chain transferred from a dolichol intermediate and only trimmed (see Panel c in Rg. 2). A unique structure having 
characteristic xylose or the like (see Panel b in Rg. 2) is also observed in insects, plants and mollusks. In animals, char- 
acteristic sugar chain structures such as complex type sugar chain (Panel e in Rg. 2) and hybrid type sugar chain 
(Panel d in Rg. 2) are observed; in the former, GlcNAc branching structures are formed in a once trimmed sugar chain, 
and addition of other kinds of monosaccharides such as galactose and sialic acid forms complicated structures; in the 
latter, both a complex type sugar chain and a high mannose type sugar chain are present [Kiyoshi Furukawa. Sugar 
Chain Technology, Industry Survey Association (1992), 64-75]. 

[0004] Such sugar chains as described above are conferred on most of cell surface proteins and secretion proteins, 
and are thought to play important roles which determine the natures and properties of cells and proteins. Among all. 
the portion of a sugar chain structure which forms a branch elongating like antennas from the common core sugar chain 
is called a sugar chain branching structure. This structure is believed to have a function to give an organism recognition 
ligand (i.e.. the end portion of the sugar chain) a high degree of freedom to thereby provide chances for multipoint rec- 
ognition and another function to maximize the protection ability for the protein moiety by greatly increasing the space- 
occupying volume (Takeuchi et al.. supra). Therefore, by controlling the branching structure of sugar chains, it is possi- 
ble to modify the physiological functions, sush as in vivo stability, in vivo kinetics and organ-targeting properties of glyc- 
oproteins in various ways. In view of this, technology to control branching structures of sugar chains is expected as 
biotechnology of the next generation for the development of glycoprotein-type pharmaceuticals which are "tender to 
humans". 

2. Physiological Significance of Glycoprotein Sugar Chains 

[0005] Sugar chains of secretion type glycoproteins exhibit excellent functions in biosynthesis, intracellular sorting, 
masking of antigenicity, in vivo stability and organ-targeting properties of glycoproteins. Sugar chains of cell surface 
proteins are known to change in response to changes in cells (such as differentiation, change to a morbid state, can- 
ceration). In particular, it has been reported that there is a close relation between the metastasis of cancer and the 
branching structure of sugar chains. 

(1) Masking of Antigenicity 

[0006] It is considered that sugar chains have a high degree of freedom in terms of steric structure and thus are mov- 
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ing freely like propellers. Therefore, protein molecules such as proteases and anttoodies against proteins not having 
affinity to sugar chains are shook off by the sugar chains and thus cannot gain access to the protein moiety. As a result, 
even if there is antigenicity in the peptide moiety near the sugar chain binding site, antibody molecules cannot have 
access to the peptide moiety. Thus, an antigen-antibody reaction is extremely difficult to occur Further, when a glyco- 
protein has been captured by a macrophage and the degradation products are presented as antigen, receptors are dif- 
ficult of access to the peptides around the sugar chain binding site. Thus, antigenic stimulation is difficult to occur. 
Actually, it is reported that when sugar chains have been introduced into the central portion of the antigenic peptide of 
ovalbumin lysozyme, the binding of MHC class II molecules to the antigen is remarkably inhibited [Mouritsen. S., 
MekJal, M., Christiansen-Brams. L. Eisner. H. and Werdelin, a. Eur. J. Immunol., (1994). 24, 1066-1072]. The effect of 
such masking of antigenicity becomes greater as the volume occupied by sugar chains is greater. Thus, it is considered 
that the development of a branching structure contributes to the effect of such masking greatly. 

(2) In Vivo Stability 

[0007] With respect to erythropoietin which is the first glycoprotein-type pharmaceutical ever produced from a trans- 
genic animal cell as a host, the functions of sugar chains thereof have been studied thoroughly. As a result, it has been 
shown that the sugar chains of erythropoietin work inhibitorily against the binding of erythropoietin with its receptor but 
make a decisive contribution to the retaining of the active structure and the improvement of in vivo kinetics; as a whole, 
the sugar chains have been shown to be essential for expression of the pharmacological activity of erythropoietin 
(Takeuchi, M. and Kbbata. A., Glycobiology (1991). 1, 337-346]. In particular, a strong correlation between the number 
of antennae in sugar chains and the pharmacological effect of erythropoietin has been found, and thus the importance 
of its branching structure (a branching structure formed by GlcNAc residues attaching to the core sugar chain) which 
never attracted attention has been made clear for the first time [Takeuchi. M., Inoue. N., Strickland. T W. f Kbbata. M., 
Wada. M.. Shimizu, R., Hoshi, S.. Kbzutsumi, H., TakasaW. a and Kobata, A.. Proc. Natl. Acad. Sci. USA, (1989), 86, 
7819-22]. The major cause of the above phenomenon is explained as follows: erythropoietin without developed branch- 
ing structure is cleared rather rapidly in kidney and, as a result the in vivo residence time of such erythropoietin 
becomes shorter [Misaizu, T, Matsuki. S., Strickland, T. W, Takeuchi, M.. Kobata, A. and TakasaW, S. ( Blood, (1995), 
86, 4097-4104]. 

(3) Organ Targeting Property 

[0008] Most of biological tissues have lectin-like receptors and use then in cell-cell interactions or to uptake glycopro- 
teins from blood. The asialoprotein-bjnding lectin in liver is a representative example of a clearance system for aged 
glycoproteins pbshihiro Kawasaki, Sugar Chain Technology. Industry Survey Association (1992). 125-136]. In addition, 
selectin contained in vascular endothelial cells, platelets and leucocytes (Kawasaki, supra) and the lectin receptor 
present on the surface of macrophages and NK cells (Kawasaki, supra) are well known. Furthermore, not only glyco- 
proteins but also cells are known to gather in a specific tissue using sugar chains as ligands. Cases of the homing of 
bone marrow cells [Tatsuo Irimura, "Glycobiology Series 3: Glycobiology in Cell Society", Katsutaka Nagai, Senichiro 
Hakomori and AWra Kobata (eds.), Kbdansha Scientific Co., (1993), 127-175] and the recruiting of neutrophils to 
inflammatory sites (Irimura. supra) are examined in detail. Putting all these things together, it can be well assumed that 
glycoproteins and cells have, via their sugar chain structures, a targeting property toward specific organs or tissues pre- 
senting a lectin receptor in blood circulation, although such a targeting system is not found in all organ. This means that 
drug delivery by means of sugar chains is possible. In such drug delivery, the affinity of lectin for sugar chains is greatly 
influenced by the degree of freedom and the number of sugar chain ligands. Therefore, the branching structure of sugar 
chains will be the most important point in such drug delivery 

(4) Correlation between Cells* Change into Morbid State and Sugar Chain Branched Structure thereof [Junko Kato, 
Naoko Suzuki. " Sugar Chain Technology and Development of Pharmaceutical". Foundation for the Relief and Study of 
Injury Caused by Pharmaceutical Side Effect (ed.), Yakugyo-Jiho-Sha, (1994), 107-13214] 

[0009] Once a plant lectin called L-PHA was developed as a probe to detect a multi-branching type sugar chain struc- 
ture, it has become possible to examine various morbid tissue samples. As a result, a tendency has been found that 
some types of cancer cells, particularly, cancer cells with a high metastasis ability are stained well with L-PHA. Thus, 
researchers have become aware of the correlation between the branching structure of sugar chains and the metastasis 
ability of cancer cells. Human chorionic gonadotropin (hCG) is a glycoprotein hormone vigorously biosynthesized in vil- 
lus tissues at an early stage of pregnancy. Since a considerable amount of hCG is discharged into urine. hCG is clini- 
cally utilized as an indicator of pregnancy. The Asn-linked sugar chains mainly formed by the mono- and biantennary 
conplex type chains are characteristic to hCG. As cancer increases its malignancy from trophoblastoma to invasive 
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mole and from invasive mole to choriocarcinoma, it is reported that 2,4,2 type tri-antennary sugar chains and abnormal 
biantennary sugar chains (both are formed through the action of GnT-IV on normal biantennary and mono-antennary 
sugar chains, respectively) appear in the sugar chains of hCG [Katsuko Yamashita, Protein, Nucleic Acid and Enzyme 
(1992), 37, 1880-1888). As a cause of this phenomenon, it is suggested that the activity of GnT-IV increases as the 
malignancy of choriocarcinoma progresses. 

[001 0] y-Glutamyltranspeptidase (y-GTP) is a glycoprotein occurring specifically abundant in liver. Since the serum y- 
GTP level drastically increases when there is a liver disease, this level is used as clinical indicator of a liver disease- 
Further, Yamashita et al. [Yamashita. K., Totani, K., Iwaki, Y. Takamisawa, I., Takeishi. N., Higashi, T. f Sakamoto, Y and 
Kobata. A., J. Biochem.. (1989). 105, 728-735] have found that, as a result of canceration of cells, the sugar chain struc- 
ture of y-GTP changes abnormally in its branching structure similar to those in abnormal hCG; thus, they have reported 
the correlation between canceration and the activation of GnT-IV. The Asn-linked sugar chains of y-GTP derived from 
healthy human liver cells are composed mainly of the biantennary complex type sugar chain with small amounts of tri- 
antennary and tetra-antennary sugar chains mixed therein. In contrast, a remarkable increase in the degree of branch- 
ing was observed in the Asn-linked sugar chains of y-G TP derived from human hepatoma cells. At the same time, 
though small in amounts, high mannose type sugar chains and abnormal biantennary sugar chains (both of which were 
not observed in y-GTP from normal ceils) appeared. As a cause of these changes in sugar chain structure, a possibility 
is suggested that N-acetylglucosaminyltransferase IV (GnT-IV) and V (GnT-V) are activated in relation to canceration of 
liver cells (Yamashita et al.. supra). 

[001 1 ] It is also reported that the sugar chain branching structure of a glycoprotein in cells is greatly changed by viral 
infection (Yamashita et al, supra). BHK cells have sugar chain structures with branching up to tetraantennary type. 
When BHK cells are transformed with polyomavirus, biantennary type sugar chains decrease in the glycoprotein sugar 
chains produced by the cells, while tetraantennary type sugar chains and the N-acetyllactosamine repeat structures 
increase; as a whole, a remarkable increase in the number of branches was recognized [Takasaki S.. Ikehira, H. and 
Kobata A., Biochem. Biophys. Res. Commun., (1980). 90, (3), 735-742]. As a cause of the above change, activation of 
GnT-IV, GnT-V and i-GnT may be considered. 

3. Enzymes relating to the Sugar Chain Branching Structures of Glycoproteins 

[0012] The complex type sugar chain which is a glycoprotein sugar chain structure characteristic of animals has a 
complicated branching structure in which N-acetylglucosamine (GlcNAc) residues are attaching to the common core 
structure in various manners (Kiyoshi Furukawa, supra) (Fig. 1). Since this branching structure is closely related to in 
vivo and in vivo stability, localization, biological activity and pharmacological property of glycoproteins (Makoto 
Takeuchi, supra), the process of biosynthesis of the branching structure has been investigated in detail. By using inven- 
tive substrates H. Schachter et al. have discriminated the various enzyme activities in hen oviduct to thereby predict the 
presence of GlcNAc branch forming enzymes from GnT-l to GnT-VI (group of GlcNAc glycosyttransferases; Fig. 3) 
[Glesson, P. A. and Schachter, H.. J. Biol. Chera, (1983), 258, 6162-6173]. Thereafter, GnT-l [Kumar, R„ Yang, J., 
Larsen. R. D. and Stanley P. Proc. Natl. Acad. Sci. USA, (1990). 87. 9948-9952; Sarkar. M.. Hull. E.. Nishikawa. Y, 
Simpson. R. J., Moritz, R. L, Dunn, R. and Schachter, H., Proc. Natl. Acad. Sci., USA, (1991), 88, 234-238], GnT-il 
[D'Agostaro. GA., Zingoni, A., Moritz, RL, Simpson. FU, Schachter, H. and Bendiak, B., J. Biol. Chem, (1995), 270, 
15211-21], GnT-lll [Nishikawa. A., ihara. Y, Hatakeyama. M., Kangawa. K. and Taniguchi, N., J. Biol. Chera, (1992). 
267, 18199-18204] and GnT-V [Shorebah. M. G., Hindsgaul, O. and Pierce, M., J. Biol. Chem.. (1992), 267. 2920-2927; 
Gu. J., Nishikawa, A., Turuoka, N., Ono, M.. Ykmaguchi, N. t Kangawa, K. and Taniguchi, N., J. Biochem., (1993), 113. 
614-619] were successively purified, and the genes thereof were cloned. However, with these known GlcNAc trans- 
ferases alone, it is impossible to form the main sugar chain (tetraantennary type; see the formula below) found in a1 
acid glycoprotein known as a representative human blood glycoprotein [Yoshima. K., Tsuji. T., Irimura, T. and Osawa. 
T, J. Biol. Chem.. (1984), 256, 10834-10840] and erythropoietin (Takeuchi, M.. Takasaki, S.. Shimada, M. and Kobata, 
A., J. Biol. Chem., (1990), 265, 12127-12130]. Therefore, an N-acetylglucosaminyltransferase having such substrate 
specificity and reaction specificity that are expected in GnT-IV has been searched for as a missing link. 



GlcNAc 01^* 

GlcNAc 01^2 p l ^4GlcNAc 8 i-4GlcNAc- 

GlcNAc 01^ 4x „ / 
GlcNAc 2 

Tetraantennary type sugar chain structure 



10 
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[0013] In addition to those mentioned above, the following N-acetykjIucosaminyltransferases have been purified or 
the genes thereof have been cloned: a transferase acting on mucin-type sugar chains [Bierhuizen, M. R, Maemura, K. 
and Rukuda. M., J. Biol Chem., (1994). 269, 4473-4479]. a transferase acting on gtycolipids, and a transferase forming 
the sugar chain epitope known as I • i antigenic structure [Kawashima, H., Yamamoto, K., Osawa, T. and Irimura, T., J. 
Biol. Chem., (1993). 268, 27118-27126; Bierhuizen, M. R, Mattel, M. G. and Fukuda, M.. Genes Dev., (1993). 7, 468- 
478]. However, the substrate specificity of these transferases and the mode of binding of the GlcNAc group transferred 
by these transferases are different from those of GnT-IV. Any of these transferases does not yield products resembling 
GnT-IV products. 

Disclosure of the Invention 



[0014] It is an object of the present invention to provide an enzyme having pi -> 4 W-acetylglucosaminyltransferase 
(hereinafter referred to as "GnT-IV) activity; a gene encoding the enzyme; a recombinant DNA comprising the gene; a 

is cell containing the recombinant DNA; a method for producing an enzyme protein having GnT-IV activity comprising cul- 
turing the cell in a medium; and a saccharide in which the sugar chains are modified with GnT-IV. 
[0015] Toward the solution of the above assignments, the present inventors have made intensive and extensive 
researches. As a result, the inventors have isolated and purified a GnT-IV enzyme protein from bovine small intestine, 
and characterized the biochemical properties of the protein. Then, the inventors have succeeded in cloning a gene cod- 

20 ing for bovine GnT-IVa from a cDNA library and mRNA from the small intestine based on a partial amino acid sequence 
of the above enzyme protein. Further, based on bovine GnT-IVa gene, the inventors have succeeded in cloning two 
genes coding for human GnT-IVa and human GnT-IVb, from cDNA libraries and mRNAs from human liver and human 
lung, respectively. The present invention has been completed by confirming that the products of these genes exhibit 
GnT-IV activity. 

25 [001 6] The first invention of the present application relates to a GnT-IV having an activity to produce a saccharide hav- 
ing a partial structure represented by the formula below: 



GlcNAc (31^4 , ^Man/?l-4GlcNAc- 



35 using UDP-GlcNAc as a sugar donor and a saccharide having a partial structure represented by the formula below as 
a sugar receptor: 

40 ^3 **** P I— tGlcNAo 



45 [0017J The second invention relates to a GnT-IV consisting of the amino acid sequence shown in SEQ ID NO: 18 or 
the amino acid sequence shown in SEQ ID NO: 18 which has addition, deletion or substitution of one or more amino 
acid residues and yet which produces GnT-IV activity; a GnT-IV consisting of the amino acid sequence shown in SEQ 
ID NO: 24 or the amino acid sequence shown in SEQ ID NO: 24 which has addition, deletion or substitution of one or 
more amino acid residues and yet which produces GnT-IV activity; and a GnT-IV consisting of the amino acid sequence 

50 shown in SEQ ID NO: 37 or the amino acid sequence shown in SEQ ID NO: 37 which has addition, deletion or substi- 
tution of one or more amino acid residues and yet which produces GnT-IV activity. 

[0018] The third invention relates to a GnT-IV gene coding for a GnT-IV consisting of the amino add sequence shown 
in SEQ ID NO: 18 or the amino add sequence shown in SEQ ID NO: 18 which has addition, deletion or substitution of 
one or more amino add residues and yet which produces GnT-IV activity; a GnT-IV gene coding for a GnT-IV consisting 
55 of the amino acid sequence shown in SEQ ID NO: 24 or the amino acid sequence shown in SEQ ID NO: 24 which has 
addition, deletion or substitution of one or more amino acfcJ residues and yet which produces GnT-IV activity; a GnT-IV 
gene coding for a GnT-IV consisting of the amino acid sequence shown in SEQ ID NO: 37 or the amino acid sequence 
shown in SEQ ID NO: 37 which has addition, deletion or substitution of one or more amino acid residues and yet which 
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produces GnT-IV activity; a GnT-IV gene consisting of the nucleotide sequence shown in SEQ ID NO: 17; a GnT-IV 
gene consisting of the nucleotide sequence shown in SEQ ID NO: 23; and a GnT-IV gene consisting of the nucleotide 
sequence shown in SEQ ID NO: 36. 

1001 9] The fourth invention relates to a recombinant DNA obtainable by inserting any of the above GnT-IV gene into 
5 a vector DNA;. and a chromosomal fragment comprising a part or all of any one of the above GnT-IV gene. 

[0020] The fifth invention relates to a host cell carrying the above recombinant DNA; and a host ceil into which the 
above chromosomal fragment is artificially introduced. 

[0021 J The sixth invention relates to a method for producing a GnT-IV comprising culturing the above host cell in a 
medium and recovering the GnT-IV from the resultant culture; and a method for producing a GnT-IV comprising recov- 
io ering the GnT-IV enzyme from the secreta. body fluids or homogenete originated from the above host cell. 
[0022] The seventh invention relates to a method for purifying the GnT-IV from biological samples. 
[0023] The eighth invention relates to a saccharide of which the sugar chain structure is modified with the GnT-IV. 
[0024] Hereinbelow, the present invention will be described in detail. 
[0025] The GnT-IV gene of the invention can be isolated as described below. 

15 

Isolation of Bovine GnT-PVa Gene 

[0026] First, a microsome fraction from bovine small intestine solubifized with a detergent is subjected to a series of 
purification procedures using anion exchange chromatography, copper chelate chromatography; two-step affinity chro- 
20 matography using a substrate analogue and gel filtration to thereby obtain a purified sample of GnT-IV enzyme. The 
resultant purified sample is subjected to SDS-PAGE and then transferred onto a PVDF membrane. The transferred pro- 
tein, as it is or after restricted hydrolysis, is analyzed with a gas phase amino acid sequencer to obtain a partial amino 
acid sequence for the GnT-IV enzyme. 

[0027] Subsequently, an RT-PCR is performed on the RNA extracted from the animal cells (i.e., bovine small intestine) 
25 as a template using primers designed based on the partial amino acid sequences determined above. Further, using a 
fragment obtained by the RT-PCR as a probe, the GnT-IV gene of interest is screened from a cDNA Itorary from the 
above-mentioned tissue by plaque hybridization. A cDNA fragment contained in the resultant positive plaque is cut out 
and subcloned into a vector such as pUC19, followed by analysis of the nucleotide sequence thereof. If the full length 
of the gene coding for the protein of interest is not contained in the fragment, plaque hybridization is performed again 
30 using a part of the subcloned cDNA fragment as a probe. Alternatively, terminal portions of the cDNA of interest are 
obtained by RACE or the like based on the information one the nucleotide sequence obtained above. The thus obtained 
GnT-IV gene (which is named GnT-IVa aferward) is subjected to analysis of its entire nucleotide sequence. Subse- 
quently, the amino acid sequence is translated from the gene having the above-mentioned nucleotide sequence. This 
amino acid sequence is as shown in SEQ ID NO: 18. 

35 

Isolation of Human GnT-IVa and GnT-IVb Genes 

[0028] Human GnT-IVa and GnT-IVb genes can be obtained by performing a RT-PCR using RNA extracted from a 
human tissue (liver or lung) and based on the information on the nucleotide sequence of bovine GnT-IVa gene as 

ao obtained above, followed by screening of a cDNA library from the above tissue. The resultant human GnT-IVa and GnT- 
IVb genes are subjected to analysis of their entire nucleotide sequences. Subsequently, the amino acid sequences are 
translated by these genes. These amino acid sequences are as shewn in SEQ ID NOS: 24 and 37. 
[0029] In order to obtain a DNA coding for the amino acid sequence shown in SEQ ID NO: 18, 24 or 37 having addi- 
tion, deletion or substitution of one or more amino acid residues, a number of methods may be used. For example, a 

45 method of treating DNA with a mutagen to induce point mutation or a deletion mutation; a method comprising cleaving 
DNA selectively, removing or adding a selected nucleotide and then ligating DNA; site-specific mutagenesis; and the 
like may be enumerated. 

[0030] The GnT-IV protein of the invention can be produced by preparing a recombinant vector into which a DNA cod- 
ing for the GnT-IV of the invention obtained by the method described above is inserted downstream of a promoter, intro- 

so ducing the vector into a host ceil and culturing the resultant cell. The vector DNA used for this purpose may be either 
plasmid DNA or bacteriophage DNA. For example, pSVL vector (Pharmacia, Sweden) shown in an Example described 
later may be used. As the host cell into which the resultant recombinant DNA is introduced, any cell that is convention- 
ally used in recombinant DNA techniques may be used, for example, a prokaryotic cell, an animal cell, a yeart, a fungi, 
an insect cell Specific examples include Escherichia coli as a prokaryotic cell and CHO cells from Chinese hamster or 

55 COS cells from monkey as an animal cell. 

[0031] The transformation of the host cell described above is performed by conventional methods for each host. For 
example, if the host is E. coli, a vector comprising the recombinant DNA is introduced by the heat shock method or elec- 
troporation into competent cells prepared by the calcium method or the like. If the host is yeast, a vector comprising the 
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recombinant DNA is introduced by the heat shock method or electroporation into competent cells prepared by the lith- 
ium method or the like. If the host is an animal cell, a vector comprising the recombinant DNA is introduced into the cell 
at the growth phase or the like by the calcium phosphate method, lipofection or electroporation. 
[0032] By culturing the thus obtained transformant in a medium, the GnT-IV protein is produced. 
[0033] In the cultivation of a transformant. any medium may be used as long as the host is viable in it For example, 
LB medium or the like may be used "rf the host is E. coll If the host is yeast. YPD medium or the like may be used. If the 
host is an animal cell. Dulbecco's medium supplemented with an animal serum or the like may be used. The cultivation 
is performed under conditions conventionally used for the host For example, if the host is E. coli, cells are cultured at 
about 30-37°C for about 3-24 hours with, H necessary, aeration and/or agitation. If the host is yeast, cells are cultured 
at about 25-37°C for about 1 2 hours to 2 weeks with, if necessary, aeration and/or agitation. If the host is an animal cell, 
cultivation is performed at about 32-37°C under 5% 0O 2 and 100% humidity for abort 24 hours to 2 weeks with, if nec- 
essary, change of the aeration conditions and/or agitation. 

[0034] After the cultivation, the cultured microorganism or cells are disrupted using a homogenizer, French press, son- 
ication. lysozyme and/or freeze-thawing to thereby elute the GnT-IV protein outside the microorganism or cells. Then, 
the protein can be obtained from soluble fractions. If the protein of interest is contained in insoluble fractions, the insol- 
uble fractions are collected by centrifugation after disruption of the microorganism or cells. Then, the protein may be 
solubflized with a buffer containing guanidine hydrochloride or the like for recovery. Alternatively, the cultured microor- 
ganism or cells may be disrupted directly with a buffer containing a protein denaturing agent such as guanidine hydro- 
chloride to thereby elute the protein of interest outside the microorganism or cells. 

[0035] Purification of the GnT-IV protein from the above supernatant may be performed by the method described in 
Example 1. Alternatively, this purification may be performed by appropriately combining conventional separation/purifi- 
cation methods. These conventional separation/purification methods include, but are not limited to. centrifugation, salt- 
ing out. solvent precipitation, dialysis, ultrafiltration, partition chromatography, gel filtration, capillary electrophoresis. 
TLC, ion exchange chromatography, metal chelate chromatography, affinity chromatography, reversed phase chroma- 
tography and isoelectric focusing. 

[0036] The biochemical properties of the GnT-IV enzyme protein obtained from bovine small intestine as described 
above are as follows. 

(1) Action 

[0037] This enzyme protein produces a saccharide having a partial structure represented by the formula below: 



GlcN Ac Bl^ A ^,?Man p l-4GlcNAc- 



using UDP-GIcNAc as a sugar donor and a saccharide having a partial structure represented by the formula below as 
a sugar receptor: 



3 Man p I— 4GlcNAc- 



[0038] The saccharide as a sugar receptor means an oligosaccharide, polysaccharide, glycoconjugate (glycopeptide, 
glycoprotein, glycolipid or proteoglycan) or a derivative thereof. 

(2) Substrate Specificity 

[0039] When the sugar receptor is an oligosaccharide (for the structures of oligosaccharides, see Fig. 4), the enzyme 
protein exhibits reactivities of 0% toward core type oligosaccharides. 54% toward GnT-l product type oligosaccharides 
and 164% toward GnT-V product type oligosaccharides, wherein the reactivity of the enzyme protein toward GnT-ll 
product type oligosaccharides is regarded as 100%. 
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[0040] The enzyme protein exhibits a reactivity of 46% toward a structure of GnT-ll product type oligosaccharides in 
which fucose is attached via a1 ~-> 6 linkage to the GlcNAc at the reducing terminus. 

[0041] The enzyme protein exhibits a reactivity of 0% toward a structure of GnT-ll product type oligosaccharides in 
which the GlcNAc on the a1 -» 3 mannose is lacking. 

[0042] The enzyme protein exhibits a reactivity of 1 6% toward a structure of GnT-ll product type oligosaccharides in 
which galactose is attached via pi -> 4 linkage to the GlcNAc on the <x1 -> 6 mannose, and a reactivity of 0% toward a 
structure of GnT-ll product type oligosaccharides in which galactose is attached p1 -» 4 linkage to the GlcNAc on the 
a1 3mannose. 

[0043] The enzyme protein exhibits a reactivity of 0% toward a structure of GnT-ll product type oligosaccharides in 
which GlcNAc is attached via 01 -> 4 linkage to the pi -> 4 mannose. 

(3) Molecular Weight 

[0044] About 66 K as determined by SDS-PAGE (under non-reducing conditions). About 60 K after treatment with 
peptide N-glycosidase F. Since a shift of band is observed when peptide N-glycanase is used, the enzyme protein is 
thought to be a glycoprotein. 

[0045] The apparent molecular weight as determined by filtration with a gel containing Triton X-100 is 77 K. Thus, it 
is thought that GnT-IV does not have a subunit structure and functions as a monomer. 

[0046] The protein moiety of this enzyme deduced from the. nucleotide sequence thereof consists of 535 amino acid 
residues and has a molecular weight of 61614. 

(4) Optimum pH 

[0047] The optimum pH for reaction is about 5.5. More than 50% of the maximum activity is observed in the range 
from pH 6.5 to 8.0. 

(5) Inhibition, Activation and Stabilization 

(i) Inhibition 

[0048] The activity of this enzyme is inhibited by addition of 20 mM EDTA. 

[0049] This enzyme is inhibited by UDP derivatives. The intensity of inhibition is in the following order: UDP»UDP- 
GloUDPTGatNAc»2'-deoxy UDP>UDP-hexanolamine»UDP-Gal>UTP>UDP-glucuronic actd>UMP. 
[0050] Uridine. TDP and CDP do not have inhibitory effect. 

(ii) Activation 

[0051] A divalent cation is essential for expression of the activity. Among divalent cations, Mn 2 * shows the greatest 
effect. At a concentration of 7.5 mM, the respective effects of Co 2+ and Mg 2 * are about 70% of that of Mn 2 *, and the 
effect of Ca 2+ is about 10% of that of Mn 2 *. The effect of Mn 2+ is greatest in the range from 5 to 20 mM. 

(iii) Stabilization 

[0052] Stabilizing effect is recognized in BSA and glycerol. 

(6) Kinetic Parameters 

[0053] When the saccharide as a receptor is an oligosaccharide (for the structures of oligosaccharide, see Fig. 4): 

(i) under assay conditions in which the enzyme is reacted in 50 ul of 125 mM MOPS buffer (pH 7.3) containing 0.8 
mM receptor substrate. 20 mM UDP-GtcNAc, 7.5 mM MnCfe. 200 mM GlcNAc. 0.5% (w/v) Triton X-100, 10% glyc- 
erol and 1% BSA at 37°C for 4 hours: 

Km and Vmax values toward GnT-ll product type oligosaccharide are 0.73 mM and 3.23 uM/min, respectively. 
Km and Vmax values toward GnT-V product type oligosaccharide are 0.13 mM and 1.75 uM/min, respectively. 



When GnT-ll product type oligosaccharide is the receptor substrate, Km value toward UDP-GIcNAc is 0.22 
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mM. 

(ri) under assay conditions in which the enzyme is reacted in 125 mM MOPS buffer (pH 7.3) containing 120 mM 
UDP-GIcNAc, 7.5 nM MnCI 2 , 0.5% (w/v) Triton x-100, 10% glycerol and 1% BSA at 37°C for 4 hours: 

s Km and Vmax values toward GnT-ll product type oligosaccharide are 0.59 mM and 0.74 mM/min/mg. respec- 

tively. 

Km and Vmax values toward GnT-V product type oligosaccharide are 0.14 mM and 0.47 mM/min/mg. respec- 
tively. 

10 (7) GnT-IV Family 

[0054] The homology between bovine GnT-lva and human GnT-IVa is 91% at the nucleic acid level and 96% at the 
amino acid level. 

[0055] All of the partial amino acid structures contained in the purified GnT-IV from bovine small intestine are encoded 

is in the bovine GnT- IVa gene. 

[0056] Human GnT-lVb and human GnT-IVa have 63% homology at the nucleic acid level and 62% homology at the 
amino acid level. However, they are entirely different in the C-terminal and N-terminal regions. 
[0057] From the biochemical properties described above, the GnT-IV of the invention has been recognized as a novel 
enzyme in the point that this enzyme is able to perform the following reaction which conventional enzymes cannot per- 

20 form: 



UDP-GIcNAc 



UDP 



25 



— Man a 1 - 



GlcNAc 0 I — 2Man o 1 ' 



— Man a 1^ 



\ Man 0 I 



-GlcNAc 0 1^ 3 Man 0 1 

% Man o 



GlcNAc 0 



30 



Brief Description of the Drawings 

35 

[0058] 



Fig. 1 shows the biosynthetic pathway of Asn-linked sugar chains. 

Fig. 2 shows variations of the Asn-linked sugar chain (revised from Fig. 1 in Makoto Takeuchi, Wako Purechemical 
40 Newsletter 64, 1 8-1 9. 1 996). 

a. Mannan type: a sugar chain structure characteristic of fungi such as yeasts and molds. 

b. Xylo-high-mannose type: a structure characteristic of plants, mollusks and insects. 

c. High mannose type: a structure commonly seen in plants, insects and animals. 
45 d. Hybrid type: a structure commonly seen in insects and animals. 

e. Complex type: a structure characteristic of animals. 

f. Prokaryotic cells: have no system for biosynthesis of Asn-linked sugar chains. 



so The portion boxed with dotted lines represents the common core sugar chain. 

Fig. 3 shows the positions of GlcNAc transfer by various GlcNAc transferases (GlcNAc glycosyltransferases). 

Fig. 4 shows the designations and structures of oligosaccharides. 

Fig. 5 shows a high performance liquid chromatogram for GnT-IV reaction products. 

Fig. 6 shows the results of analysis of GnT-IV by Q-Sepharose FF chromatography. 
55 Fig. 7 show the results of analysis of GnT-IV by copper chelate Sepharose FF chromatography. 

Fig. 8 shows the results of analysis of GnT-IV by UDP-Hexano!amine Agarose affinity chromatography (I). 

Fig. 9 shows the results of analysis of GnT-IV by UDP-Hexanolamine Agarose affinity chromatography (II). 

Fig. 10 shows the results of analysis of GnT-IV by Superdex 200 gel chromatography. 
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Fig. 11 is a photograph showing the results of SDS-PAGE (SDS polyacrylamide gel electrophoresis) of purified 
GnT-IV. 

Fig. 12 shows the results of native gel electrophoresis of purified GnT-IV and the activity thereof. 

Fig. 13 shows Smith degradation profile of GnT-IV, -V and VI product type oligosaccharides. 

Fig. 14 shows the results of 1 H-NMR (30°C) of the GnT-IV reaction product. 

Fig. 15 shows the optimum pH for GnT-IV. 

Fig. 16 shows the optimum Mn 2+ concentration for GnT-IV. 

Fig. 1 7 shows the results of analysis by SDS-PAGE and f luorochromatography of glycoproteins which are the reac- 
tion products of GnT-IV. 

Lanes 1 and 2: 7.6 fig of asialo agalacto human transferrin 
Lane 3: 7.6 ug of asialo human transferrin 

Lanes 4 and 5: 2.8 \iq of asialo agalacto, CHO cell-derived recombinant human erythropoietin 
Lanes 6 and 7: 1.3 ug of asialo agalacto fetuin 

Lanes 1, 4 and 6 represent mock experiments in which reaction was performed without GnT-IV. M represents 
molecular markers (Bio-Rad). PM represents pre-stained molecular markers (Bio-Rad. USA). 

GnT-IV reaction conditions: To 10 pi of a solution containing 0. 702 mnol/hr of GnT-IV, a substrate glycoprotein 
equivalent to 1 .6 nmol of biantennary type sugar chains (for fetuin alone, the sugar chain content was 1.6 nmol) 
and 450 na of UDP-{ 14 CJ GlcNAc. an equal volume of an assay mixture (250 mM MOPS buffer, pH 7.3, 400 mM 
GlcNAc, 20% glycerol, 1 .0% (w/v) Triton X-100, 15 mM Mna 2 , 1 mM UDP-GlcNAc) was added to obtain a reaction 
solution, which was incubated at 37°C for 20 hours. One tenth of the resultant solution was analyzed by SDS-PAGE 
andfluorography. 

For SDS-PAGE, 10-20% gradient gel (Daiichi Kagaku) was used. For fluorography, Amplify (Amersham) was 
used to expose the samples to X ray film for 20 hours. The visualization of biantennary sugar chains in the glyco- 
protein was performed using ConA-HRP and POD-lmmunostain Kit (Wako Purechemical) after dot-Wotting onto a 
PVDF membrane. 

Fig. 18 shows the open rearing frame of human GnT-IVa and the region contained in pCore-His expression vector. 
Fig. 1 9 shows the results of isoelectric focusing and Western analysis of erythropoietins produced by individual cell 
clones. Using two erythropoietin-producing strains and the same strains into which bovine and human GnT-IVa 
genes were introduced, respectively, the erythropoietin secreted by each strain was analyzed by isoelectric focus- 
ing and Western blotting using anti-erythropoietin antibody. On the left side, the positions of pi markers are shown. 

Best Modes for Carrying Out the Invention 

[0059] Hereinbelow. the present invention will be described in more detail with reference to the following Examples. 
However, the present invention is not limited to these Examples. 

[Reference Example 1] 

(1) Reagents Used in the Examples 

[0060] Unless indicated otherwise, the reagents used were the highest grade products manufactured by Wako 
Purechemical Industries, Ltd. 

(i) Pyridylaminated Oligosaccharides 

[0061] Each of the pyridylaminated oligosaccharides used was obtained as described below. First, pyridylaminated 
oligosaccharides were prepared from human transferrin (apo type; Sigma, USA) according to the method of Tokugawa 
et al. [Biehuizen, M.R. Mattel. M.G. and Fukuda, M. (1993) Genes Dev., 7, 468-478], The resultant material was treated 
with one or a combination of the following enzymes: Arethrobacter ureafadens4&\ved sialidase (Nacalai Tesque). 
Aspergillus sp.-derived p-galactosidase (Toyobo). Jack bean-derived p-N-acetylhexosaminidase (Seikagaku Corp.). 
GnT-V active fraction in CHO-K1 cell extract (supernatant obtained by sonication of CHO-K1 cells in 2 volumes of 5 mM 
Tris-HCI buffer, pH 7.5, containing 2 mM Mgd 2 1 mM PMSF . and centrifugation at 900xg for 10 min), and the 
GnT-V active fraction in the solubilized fraction from bovine small intestine homogenate (for the method of preparation, 
see "Preparation of the Microsome Fraction" and "Solubilization" in Example 1). A part of the pyridylaminated oligosac- 
charides were prepared by treating PA-Sugar Chain 021 and 022 (Takara Shuzo) with the above enzymes. In both 
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cases, the oligosaccharides prepared were purified by reversed phase chromatography using an ODS column (10 x 
250 mm; Vydac. USA) before use. 

(ii) Glycoprotein Substrates 

[0062] Bovine fetuin (Sigma, USA) and CHO cell-derived recombinant human erythropoietin (Kirin Brewery) were 
subjected to the following pretreatment to purify them into relatively uniform glycoform. Briefly. 40-100 mg of the glyco- 
protein was applied to a ConA-Sepharose column (5 ml; Pharmacia, Sweden) equilibrated with 10 mM Tris-HCI buffer, 
pH 7.4. containing 1 mM MgCI*,. 1 mM CaC^ and 0.15 M NaCt to thereby obtain a glycoform with a low biantennary 
sugar chain content as the non-adsorbed fraction. Thereafter, the column was eluted with the above buffer containing 
1.0 M a-methyl mannoside (Nacalai Tesque) to thereby obtain the fraction adsorbing a glycoform with a high bianten- 
nary sugar chain content. Thus, fetuin with a low biantennary sugar chain content and erythropoietin with a high biant- 
ennary sugar chain content were obtained. With respect to human transferrin, there is no need to purify this 
glycoprotein since almost all sugar chains thereof are biantennary. 

10063] The thus obtained fetuin and human transferrin were individually reacted with 1 U of sialidase and 0 or 1 07 U 
of p-galactosidase in 1 ml of 0.4 M sodium acetate buffer, pH 5.0, containing 4 mM MgCI 2 at 37°C for 16 hours to 
thereby obtain asialo or asialo agalacto glycoproteins. The erythropoietin with a high biantennary sugar chain content 
was reacted with 0.5 U of sialidase and 5 U of p-galactosidase in the same manner as described above to obtain an 
asialo agalacto glycoprotein. 

[0064] Each of the thus obtained glycoprotein substrates was dialyzed against 50 mM ammonium acetate buffer, pH 
7.3. Then, the amount of protein was determined with BCA protein assay (Pierce, USA) using BSA (bovine serum albu- 
min) as a standard Further, the protein was analyzed by SDS-PAGE (sodium dodesyl sulfate polyacrylamtde gel elec- 
trophoresis). The thus prepared glycoproteins were used in the Examples. 

(iii) RT-PCR (Reverse Transcription-Polymerase Chain Reaction) 

[0065] For RT-PCR. Access RT-PCR System (Promega, USA) was used. For the amplification of a fragment of a gene 
of interest, Pfu polymerase (Stratagene. USA) was used. 

(2) Equipment used in the Examples 

(i) Gene Sequencing 

[0066] ABI PLISM 377 DNA Sequencer (Perkin-Elmer, USA) was used. 
[REFERENCE EXAMPLE 2] Specific Assay for GnT-IV Activity 

[0067] Generally, there are two methods for assaying GnT-IV activity: a method in which the transfer of radiolabeled 
GIcNAc to an oligosaccharide substrate is examined and a method in which the transfer of GIcNAc to a labeled oli- 
gosaccharide substrate is fractionally analyzed by HPLC or the like. Taniguchi et al. developed a method in which GnT- 
III, -IV and -V activities are simultaneously determined using the GnT-ll product type oligosaccharide as a receptor 
[Nishikawa. A., Fujii. S., Sugiyama. T and Taniguchi, N. (1988) Anal. Biochem, 170, 349-354]. However, this assay 
method as it is was inappropriate for the assay during purification of GnT-IV because the relative activity of GnT-IV is 
much lower than those of GnT-III and -V. 

[0068] Then, the present inventors have developed a method fa determining GnT-IV activity quantitatively and sen- 
sitively by increasing the amount of the acceptor pyridylaminated oligosaccharide to 10-foW compared to the amount 
used in the previous assay system [Tokugawa, K.. Oguri, S. and Takeuchi, M. (1996) Giycoconjugate J., 13. 53-56]. 
Generally, it is very difficult to prepare such a large amount of acceptor oligosaccharides. However, according to the 
method of Tokugawa et al. [Tokugawa, K., Oguri. S. and Takeuchi, M. (1996) Giycoconjugate J., 13, 53-56], such oli- 
gosacchardes are readily prepared. 

[0069] In Examples of the present invention, GnT-IV activity was assayed as descrtoed below. 
[0070] The enzyme was reacted in 1 25 mM MOPS [3-(N-morpholino)propane-sulfonic acid] buffer, pH 7.3, containing 
0.8 mM pyridylaminated oligosaccharide substrate (GnT-ll product type oligosaccharide substrate). 20 mM UDP-GIc- 
NAc. 7.5 mM MnCI 2 . 200 mM GIcNAc. 0.5%(w/v) Triton X-100. 10% glycerol and 1% BSA at 37°C for 4 hours. Then, 
the reaction was terminated by boiling the solution for 2 minutes. After removal of solids with a 0.45 nm filter. 5 uJ of the 
filtrate was analyzed with an ODS-80TM column (4.6 x 1 50 mm; TOSO) (Fig. 5) at 50°C with 50 mM ammonium acetate 
buffer. pH 4.0. containing 0.15% (w/v) n-butand at a flow rate of 1 .2 ml/min. The fluorescence of pyridylamino groups 
was detected using excitation at 320 nm and emission at 400 nm. 
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[EXAMPLE 1] Isolation and Purification of the Enzyme 
(1) Screening of a Source of the Enzyme 

5 [0071 ] A source of the GnT-IV enzyme to be purified was searched for by utilizing the assay method described above. 
It was found that the relative activity of GnT-IV to those of GnT-lll and GnT-V in bovine small intestine is rather higher 
than the relative activities of GnT-IV in any other tissues as shown in Table 1 . Thus, bovine small intestine was selected 
as a starting material for purif ication. 



10 




Table 1 










Search for Sources of GnT-IV Enzyme 


to 




Specific Activity (pmol/h * mg- 
protein) 




Source of the Enzyme 


IV 


III 


V 




Cultured cells 


CHO 


10.8 


0 


1097 






Bowes 


12.0 


341 


150 


20 




AH66 1) 


2.0 


634 


30 






Solid AH 1 ) 


27 


116 


80 






Yoshida sarcoma 1 ) 


1.3 


70 


109 


25 


Rat organs 1 ) 


Small intestine 


17 


280 


68 






Heart 


9.4 


11 


10 






Spleen 


20 


100 


21 






Kidney 


1.9 


1840 


30 


30 




Brain 


3.7 


660 


38 




Human 1 ) 


Liver 


2.8 


8.1 


8.2 




Bovine organs 


Small intestine 


25 


174 


41 


35 




Heart 


N.D. 


N.D. 


N.D. 






Spleen 


10.9 


0.7 


10.9 




Colostrum 




N.D. 


N.D. 


N.D. 




N.D.: below detection limit 



*> : data from Nishikawa. A et al.. BBA 1035. 313-318 (1990) 



(2) Purification 

45 [0072] Unless otherwise indicated, all the operations were performed at 4°C. 

(i) Preparation of the Microsome Fraction 

[0073] Two kilograms of bovine small intestine (obtained from a meat processor) was minced. Then. 4 volumes of an 
so extraction buffer (10 mM Tris-HCI buffer, pH7.4, containing 0.25 M sucrose. 1 mM phenylmethylsulfonyl fluoride. 1 mM 
benzamidine hydrochloride. 1 mM dithiothreitol and 10 mg/ml antipain) was added thereto and homogenized with Pol- 
ygon (Kinematica. Sweden). The resultant homogenate was centrifuged at 900xg for 10 minutes. Then, the superna- 
tant was centrifuged further at 105.000xg for 60 minutes to thereby obtain the microsome fraction as a precipitate 
(Sample 1). 

55 

(ii) Solubilization 



[0074] Sample 1 was suspended in 3 volumes of a solubilization buffer prepared by adding Triton-1 00 to the extraction 
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buffer to give a final concentration of 1%. The supernatant was obtained by the centrifugation at 105 f 000 xg for 60 min- 
utes. The pellet was suspended again and collect the supernatant. The first and second extracts were combined (Sam- 
ple 2). 

5 (in) Q-Sepharose FF Chromatography 

[0075] Sample 2 was applied to Q-Sepharose FF Column (5 x 30 cm; Pharmacia, Sweden) pre-equilibrated with oper- 
ation buffer 1 (20 mM Tris-HCI. pH 7.4, containing 1 mM benzamidine hydrochloride. 0.1% Triton X-100 and 20% glyc- 
erol) and then eluted by a linear gradient of 0-0.5 M KCI (Fig. 6)(Sample 3). 

10 

(iv) Copper Chelate Sepharose FF Chromatography 

10076] Sample 3 was applied to Copper Chelate Sepharose FF Column (5 x 10 cm; Pharmacia, Sweden) pre-equili- 
brated with operation buffer 2 (obtainable by adding KCI to operation buffer 1 at a final concentration of 0.15 M). Then, 
is non-adsorbed fractions were washed out with 5 volumes of operation buffer 2. Thereafter, the adsorbate was eluted by 
a linear gracfient of 0.01 M glycine (Rg. 7). The resultant GnT-IV active fraction was pooled and concentrated with YM30 
ultrafiltration membrane (Anracon, USA) (Sample 4). 

(v) UDP-Hexanolamine Agarose Affinity Chromatography I 

20 

[0077] To UDP-Hexanolamine Agarose Affinity Column (1.2 x 4.5 cm; Sigma, USA) pre-equilibrated with operation 
buffer 3 (20 mM Tris-HCI, pH 8.0, containing 0.15 M KG, 10 mM MnClg, 0.05% Triton X-100 and 20% glycerol), one half 
of Sample 4 dialyzed against 1 mM benzamidine hydrochloride-added operation buffer 3 was applied. Then, non- 
adsorbed fractions were washed out with operation buffer 4 (20 mM Tris-HCI, pH 8.0, containing 10 mM MnCfe. 0.05% 
25 Triton x-100 and 20% glycerol). Thereafter, the adsorbate was eluted with operation buffer 4 to which 1 M (final concen- 
tration) KCI had been added (Rg. 8). The GnT-IV active fraction was pooled and dialyzed against operation buffer 5 
(having the same composition as that of operation buffer 4 but having a pH of 7.4) (Sample 5). 

(vi) UDP-Hexanolamine Agarose Affinity Chromatography II 

30 

[0078] Sample 5 was applied to UDP-Hexanolamine Agarose Affinity Column (1 .0 x 6.5 cm; Sigma, USA) pre-equil- 
ibrated with operation buffer 5. Then, non-adsorbed fractions were washed out with operation buffer 5. Thereafter, the 
adsorbate was eluted with MnCfe-removed operation buffer 5 (Fig. 9). The resultant GnT-IV active fraction was pooled 
(Sample 6). 

35 

(vii) Superdex 200 Gel Chromatography 

[0079] Sample 6 was concentrated with a small Q-sepharose FF column and applied to Superdex 200HR5/5 Column 
(1 x 30 cm; Pharmacia, Sweden) pre-equilibrated with operation buffer 6 (obtained by adding KCI to operation buffer 5 
40 at a final concentration of 0.15 M) (Fig. 10). Operation buffer 6 was applied to the column at a flow rate of 0.25 mi/min 
to thereby obtain the GnT-IV active fraction (Sample 7). 

(viii) The amount of protein, activity and specific activity in each purification step are summarized in Table 2. The final 
sample was purified 224,000-fold compared to the small intestine homogenate. 

45 



50 



55 



[0080] 
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Table 2 



Purification of GnT-IV 


Purification Step 


Amount of protein 

(mg) 


Total enzyme 
activity (nmoi/h) 


Specific activity 
(nmol/h/mg) 


Yield (%) 


Purification factor 
(-fold) 


Bovine small 
intestine homoge- 
nate 


112,900 


49,500 


0.44 


100 


1 


Solubilized frac- 
tion 


24.100 


14,500 


0.60 


29 


1.4 


Q-Sepharose 


4,000 


7,200 


1.80 


14 


4.1 


Cu Chelate 
Sepharose 


450 


3.670 


8.10 


7.4 


18.4 


UDP-Hex- 
anolamine I 


0.59 


1.950 


3,310 


3.9 


7,510 


UDP-Hex- 
anoiamine II 


0.035 


1,420 


40.600 


2.9 


92,200 


Superdex 200 


0.008 


790 


98,800 


1.6 


224,000 


Started from 2 kg of bovine small intestine. 



(3) Properties in terms of Enzyme Chemistry and Protein Chemistry 

(i) Purity 

[0081] Sample 7 gave a single band at a molecular weight of 60 Kin SDS-PAGE (Fig. 1 1). When Sample 7 was sub- 
jected to native-PAGE and the resultant band was cut out from the gel to determine GnT-IV activity, the location of the 
protein band agreed with the location of the activity (Fig. 12). Furthermore, any of GnT-l. -II -III or -V activity was not 
detected in Sample 7. From these findings, it was concluded that Sample 7 is pure GnT-IV. Taking into account that the 
apparent molecular weight of this protein was 77 K as determined by Triton X-100-containing gel filtration (Fig. 10). it is 
thought that GnT-IV does not have a subunit structure and expresses its activity in the form of a monomer. When Sam- 
ple 7 was treated with Peptide N-Glycosidase F (Boehringer-Mannheim, Germany), an increase in mobility was 
observed on SDS-PAGE. Thus, it is thought that GnT-IV from bovine small intestine is a glycoprotein having at least 
Asn-J inked sugar chains. 

(ii) Reaction Specificity 

[0082] When this enzyme reacts on the GnT-ll product type oligosaccharide represented by the formula below as a 
substrate: 



GlcNAc 0 1— 2Man al N6 
GtcNAc£i— 2Manal^ 



3 Man £ l— 4GicNAc p l— 4GlcNAc-PA 



under the standard assay conditions, the enzyme yielded a single product (pyrtdylaminated oligosaccharide 1) as 
assayed by HPLC. 

[0083] This product was collected, followed by determination of its structure by (0 a combination of Smith degradation 
and laser TOF-MS (time-of-f light mass spectrometer) and (ii) 1 H-NMR. Thus, the reaction specificity of this enzyme was 
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examined. When p/ridylaminated oligosaccharide 1 was subjected to Smith degradation according to the method of 
Kobata and Takasaki [Kbbata. A. and Takasaki, S. (1993) in GJycobiology "A Practical Approach" (Fukuda. M. and 
Kobata. A., eds) 165-185, IRL Press, Oxford, England], its mass number changed from 1599.0 to 795.30 as a result of 
the first degradation and further changed to 634.68 as a result of the second degradation. This agrees to the reaction 
pathway as shewn in Fig. 13. Thus, it was concluded that the reaction product of this enzyme has the following struc- 
ture: 



GlcNAc jS 1— 2 Man a 1^ . ^ „ A «a 

GlcNAc/n ± jMan ft l-4GlcNAc /? l-4GlcNAc -PA 

^Man a V 
GlcNAc I 2 



[0084] Further, when pyridylaminated oligosaccharide 1 was subjected to 1 H-NMR, a peak of 4.53 ppm which corre- 
sponds to the anomeric proton of GlcNAc7 shown in the following formula was detected; its coupling constant J1 ,2 was 
7.9 Hz (Fig. 14). These results indicate that GlcNAc7 is, as shown in the formula below, attached to position 4 of Man4 
via p-type linkage, supporting the above structure completely. 

GlcNAc-^Man ^ 7 - , 
7 6 $ L 1 

GlcNAc 3 Man— 4GlcNAc— 4GlcNAc-PA 

GlcNAc-^Man ' 
5 2 4 



(iii) Optimum pH 

[0085] The optimum pH of this enzyme is around 7.5 as shown in Fig. 15. 

(iv) Requirement of Divalent Cation 

[0086] As shown in Table 3, this enzyme is deactivated by the addition of EDTA (ethylenediamine tetra-acetic acid). 
A divalent cation is essential for 'tis activity. Among divalent cations, Mn 2 * shows the greatest effect, followed by Co 2 * 
and Mg 2 *. Weak effect is recognized in Ca 2+ and Fe 2+ . The optimum concentration of Mn 2 * is around 10 mM as shown 
in Fig. 16. 



Table 3 



Divalent Cation Require- 
ment of GnT-IV 


Additive 


Activity (%) 


None 


5.6 


EDTA 


0 


MnCfe 


100 


CoCI 2 


74.8 


MgCfe 


72.5 


CaCl2 


7.2 
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Table 3 (continued) 



Divalent Cation Require- 


ment of GnT-IV 


Additive 


Activity (%) 


FeCfe 


9.8 


CuCfe 


0 



70 I0087] GnT-IV activity was determined by adding each of the metal ions (10 mM) to a GnT-IV sample from which metal 
ions had been removed. GnT-IV activity is represented in per cent in the Table, wherein the activity when 10 mM MnCI 2 
was added is regarded as 100%. 

(v) Inhibition by Sugar Nucleotides 

15 

[0088J As shown in Table 4, UDP inhibited the activity of this enzyme most strongly. The inhibitory effects of UDP- 
glucose, UDP-GalNAc, 2'-deaxy-UDP and UDP-hexanoiamine (Sigma. USA) followed that of UDP in this order. Uridine. 
UMP, TDP and CDP exhibited little inhibitory effect 

20 

Table 4 



Inhibition of GnT-IV by Nucleotides 


Additive 


Activity (%) 


None 


100 


Uridine 


115 


UMP 


97.3 


UDP 


27.3 


UTP 


88.2 


TDP 


110 


CDP 


112 


2*-deoxy-UDP 


67.4 


UDP-hexanolamine 


73.6 


UDP-glucose 


56.6 


UDP-galactose 


87.3 


UDP-glucuronic acid 


92.3 


UDP-N-acetylgalactosamine 


59.7 



45 [0089] GnT-IV activity when each nucleotide (2mM) was added in the presence of 0.5 mM UDP-GIcNAc is expressed 
in per cent in the Table, wherein the activity when nothing was added is regarded as 100%. 

(vi) Substrate Specificity 

so [0090] As shown in Table 5, this enzyme preferred the GnT-V product type oligosaccharide (E in Table 5) the most as 
an acceptor. Next to this, the enzyme preferred the GnT-ll product type oligosaccharides (D in Table 5). 
[0091] When the reactivity of this enzyme toward the GnT-ll type oligosaccharide is regarded as 100%. this enzyme 
exhibits reactivities of 0% and 54% toward the core type oligosaccharides (A in Table 5) and the GnT-l product type oli- 
gosaccharides (C in Table 5). respectively. 

55 [0092] This enzyme exhibits a reactivity of 46% toward a structure of GnT-ll product type oligosaccharide in which 
fucose is attached via a1 -> 6 linkage to the GlcNAc at the reducing terminus (F in Table 5). 

[0093] This enzyme exhibits a reactivity of 0% toward a structure of GnT-ll product type oligosaccharides in which the 
GlcNAc on the a1 -> 3 mannose is lacking (B in Table 5). 
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[0094] This enzyme exhibits a reactivity of 16% toward a structure of GnT-ll product type oligosaccharides in which 
galactose is attached via p1 -> 4 linkage to the GlcNAc on the a1 -> 6 mamose (G in Table 5). and a reactivity of 0% 
toward a structure of GnT-ll product type oligosaccharide in which galactose is attached via p1 -> 4 linkage to the Glc- 
NAc on the a1 -» 3 mannose (H and I in Table 5). 
s [0095] This enzyme exhibits a reactivity of 0% toward a structure of GnT-ll product type oligosaccharides in which 
GlcNAc is attached via pi -> 4 linkage to the p1 4 mannose (J in Table 5). 

10096] The substrate specificity of this enzyme as described above almost agree with the substrate specificity of GnT- 
IV predicted by Schachter et al. [Glesson. PA and Schachter. H. (1983) J. Biol. Chem., 258, 6162-6173]: Thus, it has 
become clear that this enzyme of the invention is the very GnT-IV that has long been a missing link in the biosynthesis 
io of complex type sugar chains. 



55 



15 



20 



30 



35 



40 



45 
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Table 5 



Relative Activity of 

5 

Receptor Oligosaccharide GlcNAc Transfer (%) 

A Man a i _ 

6 Man 0 l — *GlcNAc 0 * 4GlcNAc-PA 0 

Man a l 

B GlcNAc 0 i — 2Man a l ^ 0 

| Man 13 l — 4GlcNAc 0 i — 4GlcNAc-PA 

Man a l ^ 



Man a l 54 
6 Man 0 l - <GlcNAc 0 l - 4GlcNAc-PA 
GlcNAc 0 i 2Man a I -* 3 

GlcNAc 0 l— 2Man a l ^ 

* Man 0 ! - 4GlcNAc 0 l - 4ClcNAc-PA 100 

GlcNAc 0 l — » 2Man a l 



„ E GlcNAc 0 l ^ 

25 |Man ^ 

GlcNAc Ui^* * Man 0 l — 4GlcNAc fi 1 — 4GlcNAc-PA 164 



GlcNAc 0 l — - 2Man 

Fuc c I 
1 

GlcNAc 0 l — zMan a i ^ 6 

° Man 0 i - 4GlcNAc 0 i - 4GlcNAc-PA 

GlcNAc 0 i — > 2Man a \^ 



46 



tMan fi i — 4GIcNAc 0 I — 4GlcNAc-PA 

3 



G Gal 0 l — 4GlcNAc 0 i — 2Man a x - 

GlcNAc 0 i — 2Man <x V** 16 
H GlcNAc 0 l — 2Man a 

®Man 0 i - 4GlcNAc 0 l - 4GlcNAc-PA 

Gat 0 l — 4GlcNAc 0 l — 2Man a l * 
I Gal fi l — 4GlcNAc 0 l — 2Man aK 

^Man 0 i - 4GlcNAc 0 I - 4GlcNAc-PA 

Gal fix—* <GlcNAc 0 i — 2Man a \<> 

J GlcNAc 0 i — 2Man ai N 6 

GlcNAc 0 1 — 4 Man 0 t — <GlcNAc 0 i — ^GIcNAc-PA 
GlcNAc 0 i — 2Man a i ^ 3 



50 



(vii) Kinetic Parameters 

55 

[0097] Under the assay conditions as described in Reference Example 2. the Km and Vmax values of this enzyme 
toward the GnT-ll product type oligosaccharides were 0.73 mM and 3.23 uM/min, respectively, and these values toward 
the GnT-V product type oligosaccharides were 0.13 mM and 1 .75 uM/min, respectively. The Km value toward the UDP- 



5 



10 
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GlcNAc was 0.22 mM. 

{0098] Among the pyridylaminated oligosaccharides obtained, those represented by the following formulas were 
found to be novel oligosaccharides: 

G1CNAC0 1 ^ 4l , 1-4G1CNAC/J I-.GlcNAc.PA 

GlcNAc p\^ 

and 



, 5 Gal p 1—4 GlcNAc p l^2Man a 

GlcNAc p l . 4 -IM** P l~4GlcNAc p l-4GicNAc-PA 

^2 Man a r 



20 



45 



GlcNAc 0 1 



(viii) Action on Glycoproteins 

25 [0099] In order to demonstrate that GnT-IV can act on not only oligosaccharide substrates but also oligosaccharide 
chains on glycoproteins. GnT-IV reacts on asialo agalacto glycoproteins using UDP-[ 14 C] GlcNAc as a sugar donor. 
Then, the reaction products were analyzed by SDS-PAGE and fluorography (Panels A and B ( Fig. 17). As shown in 
lanes 2 and 5 of Panel B in Fig. 17, transfer of I 14 C] GlcNAc to asialo agalacto human transferrin and asialo agalacto, 
CHO cell-derived recombinant human erythropoietin. 

so [0100] The human transferrin having the GnT-IV product type sugar chain (of the following formula) obtained by this 
GnT-IV reaction is a novel substance which does not occur in nature. 

GlcNAc /? I— 2Mana^ 

^ vr A n nMan p l-4GlcNAc Q l— 4GlcNAc 

GlcNAc B l^ 4 . 4 s* 

GlcNAc 8 V 2 

GnT-IV product type sugar chain structure 

40 



[EXAMPLE 2] Peptide Mapping Analysis 



[0101] About 1 mg of this enzyme of the invention finally purified was electrophoresed on 0.1% SDS-10% pdyacry- 
lamide gel according to the method of Laemmli [Laemmfi, U.K. Nature (1970) 313, 756-762]. The separated proteins 
were electroblotted onto a PVDF membrane. The protein fixed on the membrane was S-carboxymethylated and then 
digested with lysylendopeptidase Achromobacter protease I (AP-I) (Wako Pure Chemical Industries. Ltd) to obtain an 
so AP-l-digested fragment mixture. The AP-l-digested PVDF membrane was further digested with Asp-N (Takara Shuzo) 
to obtain an Asp-N-digested fragment mixture. Each of the peptide fragment mixtures was separated by high perform- 
ance liquid chromatography and subjected to amino acid sequence analysis. As a result, the sequences shown in SEQ 
ID NOS: 1-14 were obtained 



55 
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[EXAMPLE 3] Isolation and Identification of Bovine GnT-IVa cDNA 

(1) RT-PCR 

[0102J Based on the amino acid sequences shown in SEQ ID NOS: 7 and 1 1 obtained in Example 2, oligomer AP- 
5F shown in SEQ ID NO: 15 and oligomer DN-9R shown in SEQ ID NO: 16 were synthesized respectively. An RT-PCR 
was performed using as a template the total RNA extracted from bovine small intestine tissue by the guanidium isothi- 
ocyanate method and using the above primers. As a result, an amplified fragment of about 170 bp which seemed spe- 
cific was obtained. This fragment was subcloned. 

(2) Screening of a Library 

[0103] A bovine small intestine cDNA library (Clontech, USA) was screened with the above-mentioned RT-PCR prod- 
uct to obtain four positive plaques. The nucleotide sequences of these clones were determined. The resultant 
sequences contained a number of nucleotide sequences coding for some of the partial amino acid sequences (SEQ ID 
NOS: 1-14) obtained in Example 2, and also contained a sequence appearing to be a termination codon. Using a frag- 
ment of 150 bp representing the most upstream region of the resultant nucleotide sequence, the library was screened 
again to obtain two positive plaques. The nucleotide sequences of these clones were determined. Then, the library was 
further screened similarly with a probe of 150 bp of the most upstream region, however, new clones were not obtained. 

(3) 5* RACE (Rapid Amplif ication of the cDNA Ends) 

[0104] Subsequently, 5' RACE was performed in order to obtain a full-iength cDNA. Using the sequence of the most 
upstream region obtained by the phage screening, the first 5" RACE was performed. However, an initiation codon could 
not be found. Then, based on the sequence obtained by the first 5* RACE, the second 5* RACE was performed to 
thereby obtain a sequence containing an initiation codon. This sequence was ligated to the previously obtained partial 
gene sequence of the phage clone to thereby obtain a gene fragment containing an intact open reading frame (Gene 
1). The nucleotide sequence for the thus obtained gene fragment is shown in SEQ ID NO: 17, and the amino acid 
sequence deduced therefrom is shown in SEQ ID NO: 18. It was confirmed that this DNA fragment contains all of the 
nucleotide sequences cocfing for the 14 partial amino acid sequences (SEQ ID NOS: 1-14) obtained in Example 2. 

[EXAMPLE 4] Construction of an Expression Vector using the Cloned Bovine GnT-IVa Gene and a Method for Produc- 
ing GnT-IVa Enzyme 

(1) Construction of a Vector 

[0105] A primer (SEQ ID NO: 19) which introduces an Xhol site into a region upstream of the initiation codon of Gene 
1 and another primer (SEQ ID NO: 20) which introduces an Xbal site into a region downstream of the termination codon 
of the gene were synthesized. Then, the entire gene coding for GnT-IV enzyme was amplified by PCR with the primers. 
The amplified fragment obtained was digested with Xhol and Xbal, and inserted between the Xhol and Xbal sites of 
pSVL vector (Pharmacia, Sweden) to prepare plasmid pBGT4. 

(2) Introduction into COS7 Cells 

[0106] Plasmid pBGT4 was introduced into COS7 cells (RIKEN Cell Bank) by electroporation. Briefly, 10 *ig of the 
plasmid was added to approximately 5x10 6 cells in 0.8 ml of PBS( ) (Nissui Pharmaceutical Co.). A voltage of 1600 V 
was applied at a capacitance of 25 pf with a gene pulser (BioRad, USA) at room temperature to introduce the gene into 
the cells. The resultant cells were transferred to a 90 mm laboratory dish and cultured in 10 ml of Dulbecco's modified 
Eagle's medium (Base Catalogue No. 12430. Life Technologies. Inc., USA) containing 10% fetal bovine serum under 
5% C0 2 at 37°C for 72 hours. Thereafter, the cells were recovered and suspended in 100 pi of a buffer (5 mM Tris-HCI. 
pH 7.5, 2 mM MgCI 2 , 1 mM phenylmethylsulfonyl fluoride), followed by sonication and centrifugation at 2000 x g for 5 
minutes. Thus, a cell extract was obtained. 

(3) Assay of GnT-IV Activity 

[0107] GnT-IV activity in the cell extract was determined by the method described in Reference Example 2. The 
results are shown in Table 6. Compared to the extract from ceils into which pSVL vector was introduced as a control, 
the extracts into which plasmid pBGT4 was introduced exhibited 44-78 times higher GnT-IV activity per cell. From these 
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results, it was confirmed that Gene 1 encodes GnT-IV enzyme. Thus, GnT-IV enzyme can be produced by cultured ceils 
according to this method. 



Table 6 



Plasmid 


Specific Activity 
(pmol/hr/mg protein) 


Activity Ratio 


pSVL 


409 


1 


pBGT4(#1) 


29623 


72 


pBGT4{#2) 


31773 


78 


pBGT4(#3) 


20182 


44 


Reaction time: 4 hours 

Activity ratios are expressed in relation to the total activity 
of pSVL that is regarded as 1 . 



[EXAMPLE 51 Isolation and Identification of Human GnT-IVa cDNA 

(1) RT-PCR 

[0108] Based on the nucleotide sequence of bovine GnT-IVa obtained in Example 3. primer hi -2F shown in SEQ ID 
NO: 21 and primer h1-1R shown in SEQ ID NO: 22 were synthesized. Using total RNA from human liver (Clontech. 
USA) as a template, an RT-PCR was performed with the above primers. As a result, an amplified fragment of about 650 
bp which seemed to be specif ic was obtained. This fragment was subcloned. and the nucleotide sequence thereof was 
determined. 

(2) Screening of a Library 

[0109] A cDNA Itorary from human liver (Clontech, USA) was screened using the 685 bp DNA fragment obtained by 
the above RT-PCR as a probe. Two positive plaques of hGT4/X gt10-1 and hGT4A. gt10-2 were obtained. The nucle- 
otide sequences of the inserts in these phage clones were determined. As a result. hGT4A. gt10-1 contained a 804 bp 
DNA region and hGT4A, gt10-2 contained a 21 15 bp DNA region. The former region was entirely included in the latter 
region. As shown in SEQ ID NO: 23. the DNA fragment contained in hGT4A, gt10-2 had an open reading frame (ORF) 
highly homologous to the amino acid sequence of bovine GnT-IVa (96% identical)- From the results described in Exam- 
ple 6. it was confirmed that this ORF is human GnT-IVa gene. The amino acid sequence of this ORF is shown in SEQ 
ID NO: 24. 

[EXAMPLE 6] Construction of an Expression Plasmid for Human GnT-IVa Gene and a Method for Producing Human 
GnT-IVa Enzyme 

(1) Construction of Expression Plasmid pHGT4-1 for Human GnT-IVa Gene 

[0110] A primer (hi -7F; SEQ ID NO: 25) which introduces an Xhol site into a region upstream of the initiation codon 
of human GnT-IVa gene and another primer (hi -7R; SEQ ID NO: 26) which is conplementary to a region downstream 
of the termination codon of the gene were synthesized. Using RNA from human liver (Clontech. USA) as a template, 
the entire gene coding for human GnT-IVa enzyme was amplified by RT-PCR with the above primers. The resultant 
amplified fragment was inserted in the Srf I site of plasmid pCRScript Amp SK(+) (Stratagene. DNA) in the opposite 
direction to the transcription of lacZ gene. Using the resultant plasmid, it was confirmed by nucleotide sequence analy- 
sis that the amplified fragment encodes the amino acid sequence shown in SEQ ID NO: 24. Further, this plasmid was 
digested with Xhol and Sad to obtain an Xhol-Sacl 1.7 kb fragment This fragment was inserted bewteen the Xhol and 
Sad sites of pSVL vector (Pharmacia, Sweden) to prepare an expression plasmid pHGT4-1 for human GnT-IVa gene. 

(2) Introduction of Human GnT-IVa Gene into COS7 Cells 



[01 1 1 ] Plasmid pHGT4-1 was introduced into COS7 cells by electroporation. The resultant cells were cultured under 
10% C0 2 at 37°C for 72 hours. Then, the cells were harvested, suspended in 100 uJ of a buffer (5 mM Tris-HCI. pH 7.5, 
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2 mM magnesium chloride, 1 mM phenyimethylsuttonyt fluoride), disrupted by sonication, centrifuged at 2000xg for 5 
minutes and collect supernatant to obtain a cell extract. 

(3) Expression of Human GnT-IVa Gene in COS7 Cells 

[0112] GnT-IV activity in the cell extract was determined by the method described in Reference Example 2. The 
results are shown in Table 7. Compared to the extract from cells into which pSVL vector was introduced as a control, 
the extracts from cells into which plasmkj pHGT4-1 was introduced exhibited 21 -28 times higher GnT-IV activity per cell 
From these results, it was confirmed that GnT-IVa gene shown in SEQ ID NO: 23 encodes the glycosyltransferase GnT- 
IV. It was also confirmed that human GnT-IVa enzyme can be produced by cultured cells according to this method. 



Table 7 



Plasmid 


Specific Activity 
(pmol/hr/mg protein) 


Activity Ratio 


pSVL 


1037 


1 


pHGT4-1(#1) 


28951 


28 


pHGT4-1(#2) 


21788 


21 


pHGT4-2(#1) 


11024 


11 


pHGT4-2(#2) 


8029 


8 


Reaction time: 1 .3 hours 

Activity ratios are expressed in relation to 

of pSVL that is regarded as 1. 


the total activity 



[EXAMPLE 71 Isolation and Identification of Human GnT-IVb cDNA 

(1) Acquisition of Human GnT-IVa-like Gene by PCR, RT-PCR and 5RACE (Rapid Amplification of cDNA Ends) 

[0113] Nucleotide sequences having similarity to the nucleotide sequence of human GnT-IVa gene obtained in Exam- 
ple 3 were searched for in the DNA database GenBank by BLASTN. As a result. Accession Numbers R12057, H1 0557 
and W16571 were found out. Then, primer h2-45F shown in SEQ ID NO: 27 and primer h2-43R shown in SEQ ID NO: 
28 were synthesized to perform a PCR using a cDNA library from human brain of Quick Screen Human cDNA Library 
Panel (Clontech. USA) as a template. The amplified fragment was subdoned into the Srf I site of pCRScript Amp SK(+) 
(Stratagene, USA) and subjected to analysis of the nucleotide sequence. Also, primer h2-2F shown in SEQ ID NO: 29 
and primer h2-1 R shown in SEQ ID NO: 30 were synthesized to perform an RT-PCR using total RNA from human lung 
(Clontech, USA) as a template. As a result, an amplified fragment of about 500 bp of the expected size was obtained. 
Then, this fragment was subdoned into the Srf I site of pCRScript Amp SK(+) (Stratagene. USA) and subjected to anal- 
ysis of the nucleotide sequence. 

[01 1 4] The thus obtained nucleotide sequences of the two DNA fragments were overlapping with each other forming 
a region of 1006 bp. In this region, one reading frame which encodes the homologous amino acid sequences to those 
of bovine and human GnT-IVa was recognized. Thus, the existence of a protein relating to GnT-IVa proteins was sug- 
gested. 

{0115] Then, possible nucleotide sequences which may be an upstream sequence to R12057 or a downstream 
sequence to W1 6571 were searched for in the DNA database GenBank by BLASTN. As a result. R1 5554 was found as 
an upstream sequence to R 12057, and W16466 as a downstream sequence to W16571. However, a apparently inap- 
propriate terrranation codon was contained in the ORFs deduced from these nucleotide sequences. Therefore, in order 
to confirm the nucleotide sequences. DNA fragments were obtained by RT-PCR. As primers, h2-1F shown in SEQ ID 
NO: 31. h2-3F shown in SEQ ID NO: 32 and h2-8R shown in SEQ ID NO: 33 were synthesized. With a combination of 
h2-1 F and the h1-1 R described in Example 5. or a combination of h2-3F and h2-8R, an RT-PCR was performed using 
total RNA from human fiver (Clontech. USA) as a template. Amplified fragments of about 550 bp and about 300 bp, both 
coinciding with the expected sizes, were detected. Each of these fragments was subdoned in the Srf I site of pCRScript 
Amp SK(+) to analyze the nucleotide sequence thereof. As a result, it was confirmed that these fragments are respec- 
tively overlapping with an upstream region and a downstream region to the 1006 bp region between h2-45F and h2-1 R 
mentioned abwe. In the figated region of 1361 bp, an ORF was found which encodes 433 amino acids protein having 
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high similarity to the amino acid sequences of bovine and human GnT-IVa proteins. 

[01 1 6] However, when this ORF is compared to the amino acid sequences of GnT-IVa proteins, it was presumed that 
the starting methionine should be present in a region upstream to this ORF. Therefore, the upstream region was 
obtained by S'-RACE using Human Lung S^RACE-Ready cDNA (Clontech. USA). In the first PCR. an anchor primer 
and h2-5R shown in SEQ ID NO: 34 were used as primers. In the second PCR. an anchor primer and h2-3R shown in 
SEQ ID NO: 35 were used as primers. The fragments obtained by S'-RACE were purified, digested with EcoRI and Pstl, 
and then separated by agarose gel electrophoresis. A fragment of about 450 bp was recovered from the gel. This frag- 
ment was inserted between the EcoRI and Pstl sites of pUCl8 vector (Pharmacia, Sweden) to analyze the nucleotide 
sequence thereof. As a result, it was confirmed that this fragment is overlapping with a region upstream of the region 
between h2-1F and h2-8R. In the ligated region of 1758 bp. one ORF was confirmed which encodes 548 amino adds 
protein having high similarity to the amino acid sequences of bovine and human GnT-IVa proteins. The nucleotide 
sequence of this ORF is shown in SEQ ID NO: 36, and the amino acid sequence thereof in SEQ ID NO: 37. From the 
results described in Example 8 below, it was confirmed that this gene is human GnT-IVb gene. 

[EXAMPLE 8] Construction of an Expression Plasmid for Human GnT-IVb Gene and a Method for Producing Human 
GnT-IVb Enzyme 

(1) Construction of Expression Plasmid pHGT4-2 for Human GnT-IVb Gene 

[0117] A primer (h2-4: SEQ ID NO: 38) which introduces an Xhol site into a region upstream of the initiation codon of 
human GnT-IVb gene, and another primer (h2-10R: SEQ ID NO: 39) which introduces an Xbal site in a region down- 
stream to the termination codon of the above gene were synthesized. Using these primers, the entire ORF coding for 
human GnT-IVb enzyme was amplified by RT-PCR with RNA from human lung (Clontech, USA) as a template. The 
amplified fragment was inserted into the Srfl site of plasmid pCRScript Amp SK(+), followed by determination of the 
nucleotide sequence thereof. As a result, it was confirmed that the amplified fragment is coding for the amino acid 
sequence of SEQ ID NO: 37. Further, this plasmid was digested with Xhol and Xbal to obtain an Xhol-Xbal 1 .7 kb frag- 
ment This fragment was inserted between the Xhol and Xbal sites of pSVL vector (Pharmacia, Sweden) to construct 
an expression plasmid pHGT4-2 for human GnT-IVb gene. 

(2) Introduction of Human GnT-IVb Gene into COS7 Cells 

[0118] Plasmid pHGT4-2 was introduced into COS7 cells by electroporation. The resultant cells were cultured under 
10% C0 2 at 37°C for 72 hours. Then, the cells were recovered, suspended in 100 of a buffer (5 mM Tris-HCI, pH 7.5, 
2 mM magnesium chloride. 1 mM phenylmethylsuHonyl fluoride), disrupted by sonication, centrifuged at 2000xg for 5 
minutes and collect supernatant to thereby obtain a cell extract. 

(3) Expression of Human GnT-IVb Gene in COS7 Cells 

[0119] GnT-IV activity in the cell extract was determined by the method described in Reference Example 2. The 
results are shown in Table 7 above. Compared to the extract from cells into which pSVL vector was introduced as a con- 
trol, the extracts from ceils into which plasmid pHGT4-2 was introduced exhibited 8-1 1 times higher GnT-IV activity per 
cell. From these results, it was confirmed that the GnT-IVb gene shown in SEQ ID NO: 36 encodes the glycosyttrans- 
f erase GnT-IV. It was also confirmed that human GnT-IVb enzyme can be produced by cultured cells according to this 
method. 

[EXAMPLE 9] Construction of Expression Plasmids for Bovine GnT-IVa N-Terminal Deletion Mutants and their Expres- 
sion of 

(1) Construction of Expression Plasmids pSiglle93, pSigPro113 and pSigPro142 for Bovine GnT-IVa 

[0120] A primer (XhoEsig: SEQ ID NO: 40) which introduces an Xhol site into a region upstream of the signal 
sequence of human erythropoietin (GenBank Accession Number X02157) and an antisense primer (E4-1R: SEQ ID 
NO:4 1 ) which ligates the C-terminus of the above signal sequence to a part of the bovine GnT-IVa amino acid sequence 
spanning from position 93 (lie) to the end were synthesized to amplify the signal sequence of human erythropoietin by 
PCR. Also, a sense primer (E4-1F: SEQ ID NO: 42) corresponding to the above antisense primer and a primer (4EXPR: 
SEQ ID NO20) which introduces an Xbal site in a region downstream of the termination codon of bovine GuT-IVa gene 
were synthesized to amplify a partial sequence of bovine GnT-IVa gene by PCR. Using portions of the resultant two 
PCR products as a mixed template, a PCR was performed with primers XhoEsig and 4EXPR. The amplified fragment 
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was digested with Xhol and Xbal and inserted between the Xhol and Xbal sites of pSVL vector (Pharmacia, Sweden), 
to thereby construct plasmid pSiglle93 that expresses an amino acid sequence in which the human erythropoietin sig- 
nal is linked to a part of the bovine GnT-IVa amino acid sequence spanning from position 93 to the end. 
[01 21 ] Plasmid pSigProl 1 3 that expresses an amino acid sequence in which the human erythropoietin signal is linked 
to a part of the bovine GnT-IVa amino acid sequence spanning from position 113 (Pro) to the end; or plasmid 
pSigPro142 that expresses an amino acid sequence in which the human erythropoietin signal is linked to a part of the 
bovine GnT-IVa amino acid sequence spanning from position 142 (Pro) to the end was constructed respectively in the 
same manner as described above using E4-2R primer (SEQ ID NO:43) or E4-3R primer (SEQ ID NO; 44) instead of 
E4-1 R primer; and E4-2F primer (SEQ ID NO:45) or E4-3F primer (SEQ ID NO:46) instead of E4-1 F primer. 

(2) Introduction of plasmids expressing Bovine GnT-IVa N-Terminal Deletion Mutants into COS7 Cells 

[0122] Plasmid pSiglle93, pSigProl 13 or P SigPro142 was introduced into COS7 cells by electroporation. The result- 
ant cells were cultured under 10% C0 2 at 37°C for 72 hours. Then, the cells and the culture supernatant were recov- 
ered separately. The cells were suspended in 100 ul of a buffer (5 mM Tris-HCI. pH 7.5, 2 mM magnesium chloride. 1 
mM phenylmethylsulfbnyl fluoride), disrupted by sonication and centrifuged at 2000xg for 5 minutes to thereby obtain a 
cell extract The culture supernatant was concentrated to about 100 pi with Centriplus 30 (Amicon). 

(3) Expression of Bovine GnT-IVa N-Terminal Deletion Mutants in COS7 Cells 

[0123J GnT-IV activity in the culture supernatant and the cell extract was determined by the method described in Ref- 
erence Example 2. The results are shown in Table 8. Compared to the total activity (i.e.. activity in cells + activity in 
supernatant) of the cells into which pBGT4 vector was introduced as a positive control, the total activity of the cells into 
which pSiglle93 was introduced was more than 30%. Furthermore, more than one third of the activity was secreted into 
the culture supernatant. From these results, it was found that the amino acids from the N-terminus to position 92 of the 
bovine GnT-IVa amino add sequence can be deleted while retaining the enzyme activity. It was also shewn that GnT- 
IVa enzyme can be expressed secretively by using an appropriate secretion signal. 



30 



Table 8 



35 



40 



45 



SO 



Plasmid 


Fraction 


Activity (pmol/hr) 


Activity Ratio in Each 
Fraction (%) 


Total Activity Ratio (%) 


pSVL 


Supernatant 


136 


0.5 


1.9 


pSVL 


Cells 


384 


1.4 




pBGT4 


Supernatant 


722 


2.7 


100.0 


pBGT4 


Cells 


26152 


97.3 




pSiglle93 


Supernatant 


3106 


11.6 


31.9 


pSiglle93 


Cells 


5471 


20.4 




pSigPro113 


Supernatant 


312 


1.2 


3.4 


pSigPro113 


Cells 


606 


2.3 




pSigPro142 


Supernatant 


219 


0.8 


2.2 


pSigPro142 


Cells 


381 


1.4 




Reaction time 
The activity rc 
100%. 


: 2.5 hours 

itios are expressed in percent in relation to the total activity of pBGT4 that is regarded as 



[EXAMPLE 10] Construction of Expression Plasmids for Bovine GnT-IVa C-Terminal Deletion Mutants and their Expres- 



sion 



55 



(1) Construction of Expression Plasmids pCGIy499. pCPro465, P CLys432 and pCPro383 for Bovine GnT-IVa 

[0124] A primer (SEQ ID NO:19) which introduces an Xhol site into a region upstream of the initiation codon of bovine 
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GnT-IVa gene and a primer (CGIy499: SEQ ID NO:47) which ligates the termination codon aHer the Gly codon at posi- 
tion 499 and introduce an Xbal site in a region downstream to the termination codon above were synthesized to amplify 
a partial sequence of bovine GnT-IVa gene by PCR. The amplified fragment was digested with Xhol and Xbal. and 
inserted between the Xhol and Xbal sites of pSVL vector (Pharmacia. Sweden). Thus, plasmid pCGIy499 which 

5 expresses the bovine GnT-IVa amino acid sequence up to position 499 (Glycine) was constructed. Using CPro465 
primer (SEQ ID NO:48). CLys432 primer (SEQ ID NO:49) or CPro383 primer (SEQ ID NO:50) instead of CGIy499 
primer three additional plasmids were constructed in the same manner. They were designated pCPro465 (plasmid 
which 'expresses the bovine GnT-IVa amino acid sequence up to position 465 (Proline)); P CLys432 (plasmid which 
expresses the bovine GnT-IVa amino acid sequence up to position 432 (Lysine)); and pCPro383 (plasmid which 

io expresses the bovine GnT-IVa amino acid sequence up to position 383 (Proline)), respectively. 

(2) introduction of plasmids expressing Bovine GnT-IVa C-Terminal Deletion Mutants into COS7 Cells 

10125] Plasmid pCGIy499. pCPro465. pCLys432 or pCPro383 was introduced into COS7 cells by electroporation. 
is The resultant cells were cultured under 10% C0 2 at 3TC for 72 hours. Then, the cells were recovered and suspended 
in 100 u) of abutter (5 mM Tris-HCI. pH 7.5. 2 mM magnesium chloride. 1 mM rjhenylmethylsulfonyl fluoride), disrupted 
by sonication and centrifuged at 2000xg for 5 minutes to thereby obtain a cell extract 

(3) Expression of Bovine GnT-IVa C-Terminal Deletion Mutants in COS7 Cells 

20 

[0126] GnT-rV activity in the cell extract was determined by the method descrbed in Reference Example 2. The 
results are shown in Table 9. Compared to the GnT-IV activity of the extract from cells into which pBGT4 vector was 
introduced as a positive control, that activity of the extract from cells into which pCGIy499. pCPro465. pCLys432 or 
P CPro383 was introduced was 15.2%. 12.1%. 2.8% or 1045% per cell, respectively. From these results, it was shown 
25 that GnT-rV activity can be retained even if the amino acids from position 384 to the C-terminus in the bovine GnT-IVa 
amino acid sequence are deleted. 



Table 9 



45 



50 



55 



30 


Plasmid 


Specific Activity 
(pmol/hr/mg protein) 


Activity Ratio (%) 




pSVL 


77 


1 




pBGT4 


14917 


100 


35 


pCGIy499 


2263 


15 




pCPro465 


1798 


12 




pCLys432 


410 


3 


40 


pCPro383 


15551 


104 




Reaction Time: 2 hours 

Activity ratios are expressed in percent in relation to the total 
activity of pBGT4 which is regarded as 100%. 



[EXAMPLE 1 1] Construction of Plasmids to Express various GnT-IV Genes in E. coli and their Expression 
(1) Construction of E. coli Expression Plasmid for Bovine GnT-IVa 

[0127] A primer (BSP-N- SEQ ID MO:51) which introduces a BspHI site in a region upstream of the initiation codon 
of bovine GnT-IVa gene and another primer (C-Hd: SEQ ID NO:52) which introduces a H/hdlll site in a region down- 
stream of the termination codon were synthesized to amplify the entire open reading frame of bovine GnT-IVa gene i by 
PCR The amplified fragment was digested with BspHI and H/hdlll. and introduced between the A/col ^ " ** es 
of P Trc99A vector (Pharmacia. Sweden) to thereby construct plasmid pEBGT4. Using BSP-sN primer (SEQ ID NO:53) 
instead of BSP-N primer together with C-Hd primer, plasmid pEfle93 was constructed in a similar manner. Further, a 
gene coding for the open reading frame in which His-Tag is added to the C-terminus was amplified using BSP-N primer, 
a primer (CH-Hd- SEQ ID NO:54) which can introduce His-Tag. a termination codon and a HindM site in a region down- 
stream of the C-terminus of bovine GnT-IVa gene and a primer (H-Hd: SEQ ID NO:55) which has His-Tag. a termination 
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codon and a Hind\\\ site, to thereby construct plasmid pEBGT4+His in a similar manner. 

(2) Construction of £ coli Expression Plasmid for Human GnT-IVa Gene and Human GnT-IVb Gene 

[0128] A primer (4aBSPIL94: SEQ ID NO:56) which introduces an initiation codon and an lie codon in a region 
upstream of position 94 (Leu) of the human GnT-IVa amino acid sequence and which can also introduce a BspHI site 
at a region further upstream thereof; a primer (4aCH-Hd: SEQ ID NO: 57) which introduces His-Tag, a termination 
codon and a H/ndlll site in a region downstream of the C-terminal amino acid; and H-Hd primer were synthesized to 
amplify a gene fragment composed a partial sequence of the human GnT-IVa amino acid sequence to which a 
sequence encoding His-Tag is added. The amplified fragment was digested with BspHI and H/ndlll. and inserted 
between the Nco\ and Hind\\\ sites of pTrc99A vector (Pharmacia, Sweden) to thereby construct plasmid pMA4a+His. 
Further, using CP383H-Hd primer (SEQ ID NO:58) instead of 4aCH-Hd primer, plasmid pCore+His was constructed in 
a similar manner (Fig. 1 8). A fragment of human GnT-IVb gene was amplified using a primer (4bBSP-N: SEQ ID NO:59) 
which introduces a BspHI site in a region upstream of the initiation codon of human GnT-IVb gene and 4bSACR primer 
(SEQ ID No: 60), digested with BspHI and Sad, and then inserted between the A/col and Sacl sites of pTrc99A vector 
(Pharmacia, Sweden). Between the Sacl and H/ndlll sites of the resultant plasmid, a partial length of human GnT-IVb 
gene amplified using 4bSACF primer (SEQ ID NO:61) f a primer (4bCH-Hd: SEQ ID NO: 62) which introduces His-Tag 
at the C-terminus of the human GnT-IVb amino ackJ sequence and H-Hd primer, and digested with Sacl and H/ndlll 
was inserted to thereby achieve plasmid pEHGT4-2+His. Further, a partial sequence of human GnT-IVb gene was 
anplified using a primer (4bNCOG91 : SEQ ID NO:63) which introduces an Ncol site and an initiation codon in a region 
upstream of position 91 (Gly) of the human GnT-IVb amino acid sequence, 4bCH-Hd primer and H-Hd primer, digested 
with Afcol and HindUl and then inserted between the Ncol and H/ndlll sites of pTrc99A vector (Pharmacia, Sweden) to 
thereby construct plasmid pMA4tw-His. 

(3) Introduction of Each Expression Plasmid into £ cdi BL21 Strain 

[0129] Each expression plasmid was introduced into competent cells of £ coli BL21 strain prepared by the calcium 
method. The resultant cells were cultured on LB agar plate containing 100 ug/ml ampicillin. The resultant colonies of £ 
coli transformed with each plasmid were inoculated into LB liquid medium and cultured under shaking at 37°C over- 
night. Then, the culture was inoculated into a fresh LB liquid medium to give a concentration of 2%. While the turbidity 
(OD 595 nm) of the culture fluid was about 0.1 to 0.2, IPTG (isopropyl b-D-thiogalactopyranoskJe) was added thereto 
to give a final concentration of 1 mM. The cells were cultured at 37°C for 2 hours or at 25°C for 4 hours. Then, 500 pi 
of the cells was harvested. The cell pellet was suspended in 50 ul of a buffer (5 mM Tris-HCI. pH 7.5. 2 mM MgQ 2 . 1 
mM phenylmethylsulfonyl fluoride), disrupted by sonication and centrifuged at 2000 x g for 5 minutes to obtain a cell 
extract as a supernatant 

(4) Expression of Each Expression Plasmid in £ coli BL21 Strain 

[0130] GnT-IV activity in the cell extract was deter mined by the method described in Reference Example 2. Table 10 
shows the results of the expression of the bovine gene. Although the extract from £ coli cells into which pTrc99A vector 
was introduced as a control had little GnT-IV. the extract from £ coli cells into which pEBGT4 was introduced had def- 
inite GnT-IV activity. From these results, it was demonstrated that GnT-IV enzyme can be produced by £ coli. The His- 
tag sequence added to the C-terminus of bovine GnT-IVa did not influence greatly upon GnT-IV activity. Thus, it was 
shown that an appropriate tag sequence can be added to GnT-IV enzyme. The mutant (pElle93) in which the N-terminal 
92 amino acid are deleted exhibited stronger GnT-IV activity. Thus, the expression of variants of GnT-IV enzyme which 
was confirmed in animal ceils was also shown possible in £ coli. 



Table 10 



Plasmid 


Activity (pmol/hr/mg pro- 


Activity Ratio (%) 




tein) 




pTrc99A 


0 


0 


pEBGT4 


4611 


100 


pEBGT4+His 


3090 


67 
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Table 10 (continued) 



Plasmid 


Activity (pmol/hr/mg pro- 


Activity Ratio (%) 




tein) 




pElle93 


5841 


127 


Reaction time: 3.0 hours 




After IPTG addition, cells were cultured at 37°C for 2 hours. Activity ratios 


are expressed in percent in relation to the total activity of pEBGT4 which rs 


regarded as 100%. 







[0131] Table 1 1 shows the results of the expression of the human gene. Compared to the extract from E. colt cells 
into which pTrc99A vector was introduced as a control, the extract from E. coli cells into which any of the expression 
plasmids was introduced had GnT-IV activity significantly. As shown in bcvine GnT-IVa enzyme, it was also possible in 
human GnT-IVa and GnT-IVb enzymes to delete an N-terminaJ sequence while retaining the activity. Further, the human 
GnT-IVa enzyme which has both N-terminal deletion and C-terminal deletion exhfoited high GnT-IV activity (pCore+His). 
This shows that those portions deleted in this mutant are not essential for GnT-IV activity. 



Table 11 





Plasmid 


Activity (pmol/hr/mg pro- 
ten) 


Activity Ratio (%) 




pTrc99A 


0 


0 


25 


pEBGT4+His 


21390 


637 




pMA4a+His 


3359 


100 




pCore+His 


39766 


1184 


30 


pEHGT4-2+His 


270 


8 


pMA4b+His 


2812 


84 


35 


Reaction time: 4.0 hours 

After IPTG addition, cells were cultured at 25°C for 4 hours. 

Activity ratios are expressed in percent in relation to the total activity of 

pMA4a+His which is regarded as 100%. 



[EXAMPLE 12J Conversion of the Sugar Chain Branching Structure of a Recombinant Erythropoietin (EPO) by Intro- 
ducing Bovine or Human GnT-IVa Gene into EPO-Producing CHO Cells 

40 

(1) Introduction of GnT-IV Expression Plasmid into EPO-Producing CHO Cells 

[0132] EPO-producing CHO cell clones were created according to the method disclosed in Japanese Examined Pat- 
ent Publication No. 2-17156. GnT-IVa expression plasmid pBGT4 or pHGT4-1 was introduced into the resultant cell 

45 clones M01 and H-5 by electroporation. In the introduction, 1 5 pQ of the expression plasmid and 1 .5 \iq of a drug resist- 
ance marker plasmid (pSV2bsr from Kaken Pharmaceutical or pMAMneo from Clontech) were used in mixture. The 
electroporated cells were cultured under 10% C0 2 at 37*C for about 60 hours. Then, Hasticidin S (Kaken Pharmaceu- 
ticals) (final concentration: 10 ng/ml) or geneticin (Life Technologies. Inc.) (final concentration: 500 jig/ml) was added 
to the medium, in which the cells were cultured for another 10 days to 2 weeks. Thus, clones resistant to either of two 

so drugs were isolated. 

(2) Confirmation of the Expression of the Introduced GnT-IV Genes in EPO-Producing CHO Cell Clones 

[01 33] EPO-producing CHO cell clones (initial clones) and individual drug resistant clones were cultured in an appro- 
55 priate scale. Total RNA from each done was purified. Then, RNA dot Wot analysis was performed using a part of GnT- 
IVa gene as a probe to thereby examine the amount of GnT-IVa mRNA. Further, GnT-IV activity expressed in the initial 
clones and the drug resistant clones was determined by the assay described in Reference Example 2. Those clones 
which gave a strong signal in RNA dot Wot analysis and yet exhibited higher GnT-IV activity than the initial clones were 
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selected and used for EPO production. The selected clones had increased GnT-IV activity; for example. M01(bovine 
GnT-IV)#36 exhibited about 104-fold increase over M01 clone, and H-5(human GnT-IV)#23 exhibited about 125-fold 
increase over H-5 clone. 

5 (3) Production of EPO using GnT-IV Gene- Introduced EPO-Producing CHO Cell Clones 

[0134] EPO is expressed secretively into culture fluid. Then. EPO-producing CHO cell clones M01 and H-5, and the 
above-mentioned clones M01 (bovine GnT-IV)#36 and H-5(human GnT-IV) #23 were cultured in roller bottles. First, 
each clone was adhesion-cultured in a growth medium, and then 1.5 x 10 7 cells were transferred to a 850 cm 2 roller 

w bottle containing 200 ml of a growth medium. The cells were cultured under 10% CO z at 37°C for 3 days so that they 
adhered to the bottle uniformly. Thereafter, the growth medium was removed, and the cells were washed with PBS 
buffer. Then t 200 ml of a serum-free medium was added to the bottle, in which the cells were cultured under 10% C0 2 
at 37°C for 7 days. Thereafter, the culture supernatant was recovered. As a growth medium. D-MEM/F12 mixed 
medium supplemented with 5% fetal bovine serum. 290 mg/liter L-glutamic acid, 1xMEM non-essential amino acid 

is solution and 100 nM methotrexate was used. As a serum-free medium, the above medium without fetal bovine serum 
was used. EPO contained in each of the serum-free culture supernatants was quantitatively determined by ELISA using 
anti-human EPO antibody. 

(4) Analysis of EPOs Produced by GnT-IV Gene-Introduced or -Non-Introduced Clones Based on Their Sugar Chain 
20 Structures 

{0135] A recombinant EPO does not exist as a single molecule on isoelectric focusing gel; it is mixture of molecules 
with various electric charges. Since the protein moiety does not vary, it has been shown that the difference in electric 
charge among these molecules is based on the difference in sugar chain structure; such mixture of molecules is called 

25 glycoforms [Watson. E. and Yao, F, Anal. Biochem. (1993), 210, 389-93]. EPO has three Asn-linked sugar chains; the 
branching structures of individual sugar chains vary from biarrtennary to tetraantennary. Gal (galactose) is further 
attaching to the end of each of branched GlcNAc's, and sialic acid is further attaching to this Gal. Therefore, H the 
degree of sugar chain branching is increased by the introduction of GnT-IV gene, the number of sialic acid molecules 
attaching to Gal should increase and, thus, the content of glycoforms with low isoelectric point should increase. Then. 

30 the inventors performed analysis by isoelectric focusing to detect changes in the sugar chain structure of the EPOs pro- 
duced by GnT-IV gene-introduced EPO-producing cells. 

10136] For the isoelectric focusing, Multiphor II equipment manufactured by Pharmacia was used. The gel was com- 
posed of 5% acrylamide (30: 0.8) and 1 .5% Pharmalyte 2.5-5 (Pharmacia). As the (+) electrode solution, 0.1 M sulfuric 
acid was used. As the (-) electrode solution, 0.2 M L-histidine was used. After the isoelectric focusing, samples were 

35 electrophoretically transferred onto a PVDF membrane, followed by Western Wot analysis using anti-EPO mouse mon- 
oclonal antibody to detect the bands of individual glycoforms of the EPOs. Briefly, the serum-free culture supernatant 
of each cell clone was concentrated to about 7 to 1000-fold with Centriplus 30 and Microcon 30 (both manufactured by 
Amicon), if necessary. At the beginning, about 50-1 00 IU of EPO was used as a sample, but this amount was adjusted 
appropriately so that the intensities of bands detected by Western blot analysis would be almost equal between sam- 

40 pies. 

[0137] When the EPO from M01 clone was compared to the EPO from M01 (bovine GnT-IV)#36 clone, it was con- 
firmed that the positions of major glycoforms in the latter show a shift to the low pi side (+ electrode solution side) by at 
least one glycoform (Fig. 19). From this result it is thought that the GnT-IV enzyme expressed as a result of the gene 
introduction increased the number of GlcNAc branches in the Asn-linked sugar chains attaching to EPO, thus increas- 
es ing the number of sialic acid molecules attaching to increase the content of glycoforms with low isoelectric points. A sim- 
ilar analysis was performed on H-5 clone and H-5(human GnT-IV) #23 clone. As a result, it was also found that the 
positions of major EPO glycoforms in the latter shift to the low pi side (Fig. 19). 

[0138] From the above, it was demonstrated that it is possible to modify the structure of Ans-linked sugar chains of 
the protein produced by the cell by introducing a GnT-IV gene into any cell. 

50 

Industrial Applicability 

[0139] According to the present invention, a novel 01 -> 4 N-acetylglucosaminyltransferase (GnT-IV), a method for 
producing the GnT-IV enzyme and a gene coding for the GnT-IV are provided. With the GnT-IV of the present invention, 
55 it has become possible to produce a glycocoujugate having a branching structure which could not be formed with con- 
ventional glycosyltransferases. Thus, the GnT-IV of the invention is useful not only for producing or improving glycocon- 
jugate type pharmaceuticals, reagents and foods, but also for modifying the sugar chain structure of any biopolymer. 
[0140] The GnT-IV gene of the invention is also useful for diagnosing or treating diseases such as cancer and for mod- 
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ifying the sugar chain structure of glycoconjugate products produced by microorganisms. 

[0141] Further, an antibody or anti-serum raised against the GnT-IV protein of the invention as an antigen, or a part 
or all of the GnT-IV gene of the invention as a probe is useful for characterizing microorganisms, cultured cells, various 
animal tissues, blood cells and blood or for diagnosing diseased cells or tissues such as cancer. 

5 

SEQUENCE LISTING 

SEQ ID NO: 1 
J0 SEQUENCE LENGTH: 8 

SEQUENCE TYPE: amino acid 

TOPOLOGY: linear 
*5 MOLECULE TYPE: peptide 

SEQUENCE DESCRIPTION 
20 Asp Asn Leu Tyr Pro Glu Glu Lys 

5 



SEQ ID NO: 2 
SEQUENCE LENGTH: 11 
SEQUENCE TYPE: amino acid 

30 

TOPOLOGY: linear 
MOLECULE TYPE: peptide 

35 

SEQUENCE DESCRIPTION 

Asp Tyr Val Asn Gly Val Val Ala Asn Glu Lys 
40 5 10 



SEQ ID NO: 3 
SEQUENCE LENGTH: 21 
SEQUENCE TYPE: amino acid 
TOPOLOGY: linear 
MOLECULE TYPE: peptide 



55 



SEQUENCE DESCRIPTION 

Glu He Ser Ser Gly Leu Val Glu lie lie Ser Pro Pro Glu Ser Tyr 
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5 10 15 

5 Tyr Pro Asp Leu Thr 

20 

10 SEQ ID NO: 4 

SEQUENCE LENGTH: 8 
SEQUENCE TYPE: amino acid 

15 

TOPOLOGY: linear 
MOLECULE TYPE: peptide 

20 

SEQUENCE DESCRIPTION 

Glu Arg Val Arg Trp Arg Thr Lys 

25 5 

SEQ ID NO: 5 
30 SEQUENCE LENGTH: 15 

SEQUENCE TYPE: amino acid 
TOPOLOGY: linear 

35 

MOLECULE TYPE: peptide 

^ SEQUENCE DESCRIPTION 

Lys Gin Asn Leu Asp Tyr Cys Phe Leu Met Met Tyr Ala Gin Glu 
5 10 15 

45 

SEQ ID NO: 6 
SEQUENCE LENGTH: 6 
50 SEQUENCE TYPE: amino acid 

TOPOLOGY : linear 
MOLECULE TYPE: peptide 



55 
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10 



20 



SEQUENCE DESCRIPTION 
Asp His He Leu Trp Val 
5 



SEQ ID NO: 7 
SEQUENCE LENGTH : 14 
SEQUENCE TYPE : amino acid 
15 TOPOLOGY: linear 

MOLECULE TYPE: peptide 



25 



SEQUENCE DESCRIPTION 

Lys lie His Val Asn Pro Pro Ala Glu Val Ser Thr Ser Leu 
5 10 



SEQ ID NO: 8 
30 SEQUENCE LENGTH: 10 

SEQUENCE TYPE: amino acid 
TOPOLOGY: linear 

35 

MOLECULE TYPE: peptide 



SEQUENCE DESCRIPTION 

Lys Val Tyr Gin Gly His Thr Leu Glu Lys 
5 10 

SEQ ID NO: 9 
SEQUENCE LENGTH: 10 
SEQUENCE TYPE: amino acid 
TOPOLOGY: linear 



55 
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MOLECULE TYPE: peptide 
SEQUENCE DESCRIPTION 

Asp Phe Phe Trp Ala lie Thr Pro Val Ala 
5 10 

SEQ ID NO: 10 
SEQUENCE LENGTH: 6 
SEQUENCE TYPE: amino acid 
TOPOLOGY: linear 
MOLECULE TYPE: peptide 

SEQUENCE DESCRIPTION 
Asp Tyr lie Leu Phe Lys 
5 

SEQ ID NO: 11 
SEQUENCE LENGTH: 15 
SEQUENCE TYPE: amino acid 
TOPOLOGY: linear 
MOLECULE TYPE: peptide 

SEQUENCE DESCRIPTION 

Asp Lys Pro Val Asn Val Glu Sec Tyr Leu Phe His Ser Gly Asn 
5 10 

SEQ ID NO: 12 
SEQUENCE LENGTH: 10 
SEQUENCE TYPE: amino acid 
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TOPOLOGY: linear 
MOLECULE TYPE: peptide 

SEQUENCE DESCRIPTION 

Asp He Leu Leu X Thr Thr Val Glu Val 
5 10 

SEQ ID NO: 13 
SEQUENCE LENGTH: 9 
SEQUENCE TYPE: amino acid 
TOPOLOGY: linear 
MOLECULE TYPE i peptide 

SEQUENCE DESCRIPTION 
Lys Ser Glu Gly Leu Asp He Ser Lys 
5 

SEQ ID NO: 14 
SEQUENCE LENGTH: 8 
SEQUENCE TYPE: amino acid 
TOPOLOGY: linear 
MOLECULE TYPE: peptide 

SEQUENCE DESCRIPTION 
Asp Gly Tyr Phe Arg He Gly Lys 
5 

SEQ ID NO: 15 
SEQUENCE LENGTH: 29 
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SEQUENCE TYPE: nucleic acid 
STRANDEDNESS : single 
TOPOLOGY: linear 
MOLECULE TYPE: synthetic DNA 

SEQUENCE DESCRIPTION 

AAR ATY CAY GTB AAY CCH CCH GCN GAR GT 29 
Lys lie His Val Asn Pro Pro Ala Glu Val 

SEQ ID NO: 16 

SEQUENCE LENGTH: 35 

SEQUENCE TYPE: nucleic acid 

STRANDEDNESS: single 

TOPOLOGY: linear 

MOLECULE TYPE: synthetic DNA 

ANTISENSE: Yes 

SEQUENCE DESCRIPTION 

TG RAA VAR RTA RSW YTC VAC RTT VAC DGG YTT RTC 
His Phe Leu Tyr Ser Glu Val Asn Val Pro Lys Asp 

SEQ ID NO: 17 
SEQUENCE LENGTH: 2246 
SEQUENCE TYPE: nucleic acid 
STRANDEDNESS: double 
TOPOLOGY: linear 
MOLECULE TYPE: CDNA 

SEQUENCE DESCRIPTION 
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w 



15 



GGCGGCTGCT CGGTGGCGGC TCGTCGGCGG CCGCGGCAGG ACTGGCAGCG CCGGGGGCGG 60 
GGAGAAAGAA GCATCCACCT ATGAAGACCG TGCAGACAGT CCTGAATAAT AATTGTGAAT 120 
GGTGTGGCTG CCAGACTAGT TCTGCTGAGC ATCTGAAATG AACCTCTCCT ATTGATTGTT 180 
TCAGTTGGCC CCGAGCCAGG AGTACTGGGT TTGCTTGACT TCAGGATAAA AAGAAACGGA 240 
CTTGGTTATC ATCGTAAACA TATGAACCAG TGTGATGGTG AAATGAG ATG AGG CTC 296 

Met Arg Leu 
1 

CGA AAT GGA ACT GTA GCC ACT GTT TTA GCA TTT ATC ACC TCG TTC CTC 344 
Arg Asn Gly Thr Val Ala Thr Val Leu Ala Phe He Thr Ser Phe Leu 

i 

5 10 15 

SO ACT TTA TCT TGG TAT ACA ACA TGG CAA AAT GGG AAA GAA AAA GTG ATT 392 

Thr Leu Ser Trp Tyr Thr Thr Trp Gin Asn Gly Lys Glu Lys Val He 
20 25 30 35 

25 GCT TAT CAA CGA GAA TTT CTT GCT CTG AAA GAA CGT CTC CGA ATA GCT 440 

Ala Tyr Gin Arg Glu Phe Leu Ala Leu Lys Glu Arg Leu Arg He Ala 

40 45 50 

GAA CAT CGA ATC TCT CAG CGC TCT TCT GAG CTC AGT GCC ATT GTA CAG 488 
Glu His Arg He Ser Gin Arg Ser Ser Glu Leu Ser Ala He Val Gin 

55 60 65 

CAA TTC AAG CGT GTA GAA GCA GAA ACA AAC AGG AGT AAG GAT CCA GTG 536 
Gin Phe Lys Arg Val Glu Ala Glu Thr Asn Arg Ser Lys Asp Pro Val 

70 75 80 

AAT AAA TTT TCA GAT GAT ACC CTA AAG ATA CTA AAG GAG TTA ACA AGC 584 
Asn Lys Phe Ser Asp Asp Thr Leu Lys lie Leu Lys Glu Leu Thr Ser 
45 85 90 95 

AAA AAG TCT CTT CAA GTG CCA AGT ATT TAT TAT CAT TTG CCT CAT TTA 632 
Lys Lys Ser Leu Gin Val Pro Ser He Tyr Tyr His Leu Pro His Leu 
so 10Q 105 110 H5 

TTG CAA AAT GAA GGA AGC CTT CAA CCT GCC GTG CAG ATC GGA AAT GGA 680 



30 



35 



to 



55 
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TO 



15 



20 



25 



30 



35 



40 



45 



Leu Gin Asn 

CGA ACA GGA 
Arg Thr Gly 

GTT AAA TCT 
Val Lys Ser 
150 

TAT CCT GAA 
Tyr Pro Glu 

165 
ACA GAT ACT 
Thr Asp Thr 
180 

TTT TCT AAA 
Phe Ser Lys 

GAA AGC TAT 
Glu Ser Tyr 



so 



TCT AAA GAA 
Ser Lys Glu 
230 

TTT CTA ATG 
Phe Leu Met 

245 
GAA GAT GAT 
Glu Asp Asp 
260 



Glu Gly Ser Leu Gin Pro Ala Val Gin He Gly Asn Gly 

120 125 130 

GTT TCA ATA GTA ATG GGA ATT CCT ACA GTG AAG AGA GAA 728 
Val Ser He Val Met Gly He Pro Thr Val Lys Arg Glu 
135 140 145 

TAC CTC ATA GAA ACT CTT CAT TCC CTT ATT GAT AAT CTG 776 
Tyr Leu He Glu Thr Leu His Ser Leu He Asp Asn Leu 

155 160 
GAG AAG TTG GAC TGT GTT ATA GTA GTC TTC ATA GGA GAG 824 
Glu Lys Leu Asp Cys Val lie Val Val Phe He Gly Glu 

170 175 
GAT TAT GTA AAT GGT GTT GTA GCC AAC CTG GAG AAA GAA 872 
Asp Tyr Val Asn Gly Val Val Ala Asn Leu Glu Lys Glu 
185 190 195 

GAA ATC AGT TCT GGC TTG GTG GAA ATA ATA TCA CCT CCT 920 
Glu He Ser Ser Gly Leu Val Glu He He Ser Pro Pro 

200 205 210 

TAT CCT GAC CTG ACG AAC TTA AAG GAG ACA TTT GGA GAT 968 
Tyr Pro Asp Leu Thr Asn Leu Lys Glu Thr Phe Gly Asp 
215 220 225 

AGA GTA AGA TGG AGA ACA AAG CAA AAC CTA GAT TAT TGT 1016 
Arg Val Arg Trp Arg Thr Lys Gin Asn Leu Asp Tyr Cys 

235 240 
ATG TAT GCT CAG GAA AAA GGC ACA TAC TAC ATC CAG CTT 1064 
Met Tyr Ala Gin Glu Lys Gly Thr Tyr Tyr He Gin Leu 

250 255 
ATT ATT GTC AAA CAG AAT TAC TTT AAC- ACC ATA AAG AAT 1112 
He He Val Lys Gin Asn Tyr Phe Asn Thr He Lys Asn 
265 270 275 
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10 



15 



25 



30 



35 



40 



45 



TTT GCA CTT CAA CTT TCT TCT GAG GAA TGG ATG ATA CTT GAG TTC TCC 1160 
Phe Ala Leu Gin Leu Ser Ser Glu Glu Trp Met lie Leu Glu Phe Ser 

280 285 290 

CAG CTG GGA TTC ATT GGT AAA ATG TTT CAA GCA CCT GAC CCA CTC CTG 1208 
Gin Leu Gly Phe lie Gly Lys Met Phe Gin Ala Pro Asp Leu Thr Leu 

295 300 305 

ATT GTG GAA TTC ATA TTT ATG TTC TAT AAG GAG AAG CCC ATC GAC TGG 1256 
He Val Glu Phe He Phe Met Phe Tyr Lys Glu Lys Pro He Asp Trp 
310 315 320 

i 

CTC TTG GAC CAT ATT CTG TGG GTC AAA GTC TGC AAC CCG GAA AAA GAT 1304 
20 Leu Leu Asp His He Leu Trp Val Lys Val Cys Asn Pro Glu Lys Asp 

325 330 335 

GCA AAA CAC TGT GAT CGA CAG AAG GCA AAT CTG CGA ATT CGT TTC AGA 1352 
Ala Lys His Cys Asp Arg Gin Lys Ala Asn Leu Arg lie Arg Phe Arg 
340 345 350 355 

CCG TCC CTT TTC CAA CAC GTT GGT CTG CAT TCT TCA CTC ACA GGA AAA 1400 
Pro Ser Leu Phe Gin His Val Gly Leu His Ser Ser Leu Thr Gly Lys 

360 365 370 

ATT CAG AAA CTC ACG GAT AAA GAT TAC ATG AAA CCA TTA CTG CTC AAA 1448 
lie Gin Lys Leu Thr Asp Lys Asp Tyr Met Lys Pro Leu Leu Leu Lys 

375 380 385 

ATC CAT GTA AAC CCC CCT GCA GAG GTA TCT ACT TCT TTG AAG GTC TAC 1496 
He His Val Asn Pro Pro Ala Glu Val Ser Thr Ser Leu Lys Val Tyr 

390 395 400 

CAA GGT CAT ACA CTG GAG AAA ACT TAC ATG GGT GAG GAC TTC TTC TGG 1544 
Gin Gly His Thr Leu Glu Lys Thr Tyr Met Gly Glu Asp Phe Phe Trp 
405 - 410 415 

50 GCT ATA ACC CCA GTA GCT GGA GAC TAC ATC CTA TTT AAA TTC GAC AAG 1592 

Ala He Thr Pro Val Ala Gly Asp Tyr He Leu Phe Lys Phe Asp Lys 
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420 425 430 435 

CCA GTC A AT GTG GAA AGT TAT TTG TTC CAT AGT GGC AAC CAG GAT CAT 1640 
Pro Val Asn Val Glu Ser Tyr Leu Phe His Ser Gly Asn Gin Asp His 

440 445 450 

CCA GGG GAT ATT CTG CTC AAC ACA ACG GTG GAA GTT CTG CCT TTG AAG 1688 
Pro Gly Asp lie Leu Leu Asn Thr Thr Val Glu Val Leu Pro Leu Lys ' 

455 460 465 

AGT GAA GGT TTG GAC ATC AGC AAA GAA ACC AAA GAC AAA CGA TTA GAA 1736 
Ser Glu Gly Leu Asp He Ser Lys Glu Thr Lys Asp Lys Arg Leu Glu 

470 475 480 

GAT GGC TAT TTC AGA ATA GGG AAA TTT GAA AAC GGT GTT GCG GAA GGG 1784 
Asp Gly Tyr Phe Arg He Gly Lys Phe Glu Asn Gly Val Ala Glu Gly 

485 490 495 

ATG GTG GAT CCC AGC CTA AAC CCC ATT TCG GCC TTC CGA CTT TCA GTT 1832 
Met Val Asp Pro Ser Leu Asn Pro He Ser Ala Phe Arg Leu Ser Val 500 
500 505 510 515 

ATT CAG AAT TCT GCT GTT TGG GCC ATT CTT AAT GAG ATC CAT ATT AAA 1880 
He Gin Asn Ser Ala Val Trp Ala He Leu Asn Glu He His He Lys 

520 525 530 

AAA GTC ACA AAC TGACCATC 1900 
Lys Val Thr Asn 
535 

TCTACTAAGA AACCAACACA TTTTTTCCCT GTGAATTTGT TGATTAAAGA CAGCTGAGCA 1960 
CGTACCTTTT TTTGGTAACT TGAATTCTAC CTCTCGCGAA ATCTACTGTA GATAAAATGA 2020 
TTGTCATATT TCCACTTGGA AAATGAATCT CCCACGGATA ATTGTATTCA TTTGAATCTA 2080 
AGCTGTCCTC CAGTTTTAAC TCAACTCAAA CGTTTTACAG TTATGACAGC CTGTTAATAT 2140 
GACTTGTACT ATTTTGGTAT TATACTAATA CATAAGAGTT GTACATATTG TTACATTCAT 2200 
TAAATTTGAG AAAAATTAAT GTTAAATACA TTTTATGAAC GGGCCG 2246 
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SEQ ID NO: 18 
SEQUENCE LENGTH: 535 
SEQUENCE TYPE: amino acid 
TOPOLOGY: linear 
MOLECULE TYPE: peptide 



SEQUENCE DESCRIPTION 

Met Arg Leu Arg Asn Gly Thr Val Ala Thr Val Leu Ala Phe He Thr 

15 10 15 

Ser Phe Leu Thr Leu Ser Trp Tyr Thr Thr Trp Gin Asn Gly Lys Glu 

20 25 30 

Lys Val He Ala Tyr Gin Arg Glu Phe Leu Ala Leu Lys Glu Arg Leu 

35 40 45 

Arg He Ala Glu His Arg He Ser Gin Arg Ser Ser Glu Leu Ser Ala 

50 55 60 

He Val Gin Gin Phe Lys Arg Val Glu Ala Glu Thr Asn Arg Ser Lys 
65 70 75 80 

Asp Pro Val Asn Lys Phe Ser Asp Asp Thr Leu Lys He Leu Lys Glu 

85 90 95 

Leu Thr Ser Lys Lys Ser Leu Gin Val Pro Ser He Tyr Tyr His Leu 

100 105 HO 

Pro His Leu Leu Gin Asn Glu Gly Ser Leu Gin Pro Ala Val Gin He 

115 120 125 

Gly Asn Gly Arg Thr Gly Val Ser He Val Met Gly He Pro Thr Val 

130 135 140 

Lys Arg Glu Val Lys Ser Tyr Leu He Glu Thr Leu His Ser Leu He 
145 - 150 155 160 

Asp Asn Leu Tyr Pro Glu Glu Lys Leu Asp Cys Val He Val Val Phe 
165 170 175 



EP 0 905 232 A1 



He Gly Glu Thr Asp Thr Asp Tyr Val Asn Gly Val Val Ala Asn Leu 

180 185 190 

Glu Lys Glu Phe Ser Lys Glu He Ser Ser Gly Leu Val Glu He He 

195 200 205 

Ser Pro Pro Glu Ser Tyr Tyr Pro Asp Leu Thr Asn Leu Lys Glu Thr 

210 215 220 

Phe Gly Asp Ser Lys Glu Arg Val Arg Trp Arg Thr Lys Gin Asn Leu 
225 230 235 240 

Asp Tyr Cys Phe Leu Met Met Tyr Ala Gin Glu Lys Gly Thr Tyr Tyr 

245 250 255 

lie Gin Leu Glu Asp Asp He He Val Lys Gin Asn Tyr Phe Asn Thr 

260 265 270 

He Lys Asn Phe Ala Leu Gin Leu Ser Ser Glu Glu Trp Met He Leu 

275 280 285 

Glu Phe Ser Gin Leu Gly Phe lie Gly Lys Met Phe Gin Ala Pro Asp 

290 295 300 

Leu Thr Leu lie Val Glu Phe lie Phe Met Phe Tyr Lys Glu Lys Pro 
305 310 315 320 

He Asp Trp Leu Leu Asp His He Leu Trp Val Lys Val Cys Asn Pro 

325 330 335 

Glu Lys Asp Ala Lys His Cys Asp Arg Gin Lys Ala Asn Leu Arg He 

340 345 350 

Arg Phe Arg Pro Ser Leu Phe Gin His Val Gly Leu His Ser Ser Leu 

355 360 365 

Thr Gly Lys lie Gin Lys Leu Thr Asp Lys Asp Tyr Met Lys Pro Leu 

370 375 380 

Leu Leu Lys lie His Val Asn Pro Pro Ala Glu Val Ser Thr Ser Leu 
385 390 395 400 

Lys Val Tyr Gin Gly His Thr Leu Glu Lys Thr Tyr Met Gly Glu Asp 
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405 410 415 

Phe Phe Trp Ala He Thr Pro Val Ala Gly Asp Tyr He Leu Phe Lys 

420 425 430 

Phe Asp Lys Pro Val Asn Val Glu Ser Tyr Leu Phe His Ser Gly Asn 

435 440 445 

Gin Asp His Pro Gly Asp He Leu Leu Asn Thr Thr Val Glu Val Leu 

450 455 460 

Pro Leu Lys Ser Glu Gly Leu Asp He Ser Lys Glu Thr Lys Asp Lys 
465 470 475 480 

Arg Leu Glu Asp Gly Tyr Phe Arg He Gly Lys Phe Glu Asn Gly Val 

485 490 495 

Ala Glu Gly Met Val Asp Pro Ser Leu Asn Pro He Ser Ala Phe Arg 

500 505 510 

Leu Ser Val He Gin Asn Ser Ala Val Trp Ala He Leu Asn Glu He 

515 520 525 

His He Lys Lys Val Thr Asn 
530 535 



SEQ ID NO: 19 

SEQUENCE LENGTH: 31 

SEQUENCE TYPE: nucleic acid 

STRANDEDNESS: single 

TOPOLOGY: linear 

MOLECULE TYPE: synthetic DNA 



SEQUENCE DESCRIPTION 
CCCTCGAG ATG AGG CTC 
Met Arg Leu 



CGA AAT GGA ACT GT 31 
Arg Asn Gly Thr Val 
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SEQ ID NO: 20 

SEQUENCE LENGTH: 31 

SEQUENCE TYPE: nucleic acid 

STRANDEDNESS : single 

TOPOLOGY: linear 

MOLECULE TYPE: synthetic DNA 

ANTI SENSE: Yes 

SEQUENCE DESCRIPTION 

TTTCTAGA TCA GTT TGT GAC TTT TTT AAT AT 
TRM Asn Thr Val Lys Lys He His 

SEQ ID NO: 21 

SEQUENCE LENGTH: 24 

SEQUENCE TYPE: nucleic acid 

STRANDEDNESS: single 

TOPOLOGY: linear 

MOLECULE TYPE: synthetic DNA 

SEQUENCE DESCRIPTION 
ACGATTGTGC AACAGTTCAA GCGT 24 

SEQ ID NO: 22 

SEQUENCE LENGTH: 24 

SEQUENCE TYPE: nucleic acid 

STRANDEDNESS: single 

TOPOLOGY: linear 

MOLECULE TYPE: synthetic DNA 
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SEQUENCE DESCRIPTION 
GGGAGAACTC CAGGATCATC CAGT 24 



SEQ ID NO: 23 
SEQUENCE LENGTH: 2115 
SEQUENCE TYPE: nucleic acid 
STRANDEDNESS: double 
TOPOLOGY: linear 
MOLECULE TYPE: CDNA 



SEQUENCE DESCRIPTION 

GAAATGAACC TCTCTTATTG ATTTTTATTG GCCTAGAGCC AGGAGTACTG CATTCAGTTG 60 
ACTTTCAGGG TAAAAAGAAA ACAGTCCTGG TTGTTGTCAT CATAAACATA TGGACCAGTG 120 
TGATGGTGAA ATGAG ATG AGG CTC CGC AAT GGA ACT GTA GCC ACT GCT TTA 171 
Met Arg Leu Arg Asn Gly Thr Val Ala Thr Ala Leu 
15 10 
GCA TTT ATC ACT TCC TTC CTT ACT TTG TCT TGG TAT ACT ACA TGG CAA 219 
Ala Phe He Thr Ser Phe Leu Thr Leu Ser Trp Tyr Thr Thr Trp Gin 

15 20 25 

AAT GGG AAA GAA AAA CTG ATT GCT TAT CAA CGA GAA TTC CTT GCT TTG 267 
Asn Gly Lys Glu Lys Leu He Ala Tyr Gin Arg Glu Phe Leu Ala Leu 

30 35 40 

AAA GAA CGT CTT CGA ATA GCT GAA CAC AGA ATC TCA CAG CGC TCT TCT 315 
Lys Glu Arg Leu Arg He Ala Glu His Arg He Ser Gin Arg Ser Ser 
45 50 55 . 60 

GAA TTA AAT ACG ATT GTG CAA CAG TTC AAG CGT GTA GGA GCA GAA ACA 363 
Glu Leu Asn Thr He Val Gin Gin Phe Lys Arg Val Gly Ala Glu Thr 
65 70 75 
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AAT GGA 
Asn Gly 

CTG TTA 
Leu Leu 

TAT TAT 
Tyr Tyr 
110 
GCT GTA 
Ala Val 
125 

ATT CCC 
lie Pro 



AGT AAG 
Ser Lys 
80 

AAG GAG 
Lys Glu 

95 
CAT TTG 
His Leu 

CAG ATT 
Gin He 

ACA GTG 
Thr Val 



GAT GCG TTG AAT 
Asp Ala Leu Asn 



CAT TCC 
His Ser 

ATA GTA 
He Val 

GTA GCC 
Val Ala 
190 
GTG GAA 
Val Glu 
205 

CTA AAG 
Leu Lys 



CTT ATT 
Leu He 
160 
GTC TTC 
Val Phe 
175 

AAC CTG 
Asn Leu 

GTC ATA 
Val He 

GAG ACA 
Glu Thr 



TTA ACA 
Leu Thr 

CCT CAT 
Pro His 

GGC AAC 
Gly Asn 
130 
AAG AGA 
Lys Arg 
145 

GAT AAC 
Asp Asn 



AGC AAA 
Ser Lys 
100 
TTA TTG 
Leu Leu 
115 

GGA AGA 
Gly Arg 

GAA GTT 
Glu Val 

CTG TAT 
Leu Tyr 



ATA GGA 
He Gly 

GAG AAA 
Glu Lys 

TCA CCC 
Ser Pro 
210 
TTT GGA 
Phe Gly 



GAG ACA 
Glu Thr 
180 
GAA TTT 
Glu Phe 
195 

CCT GAA 
Pro Glu 



GAC TCC 
Asp Ser 



AAG TTT TCA GAT AAT 
Lys Phe Ser Asp Asn 
85 

AAA TCT CTT CAA GTG 
Lys Ser Leu Gin Val 
105 

AAA AAT GAA GGA AGT 
Lys Asn Glu Gly Ser 
120 

ACA GGA GTT TCA ATA 
Thr Gly Val Ser He 
135 

AAA TCT TAC CTC ATA 
Lys Ser Tyr Leu He 
150 

CCT GAA GAG AAG TTG 
Pro Glu Glu Lys Leu 
165 

GAT ATT GAT TAT GTA 
Asp He Asp Tyr Val 
185 

TCT AAA GAA ATC AGT 
Ser Lys Glu He Ser 
200 

AGC TAT TAT CCT GAC 
Ser Tyr Tyr Pro Asp 
215 

AAA GAA AGA GTA AGA 
Lys Glu Arg Val Arg 



ACC CTA AAG 411 
Thr Leu Lys 
90 

CCA AGT ATT 459 
Pro Ser He 



CTT CAA CCT 507 
Leu Gin Pro 

i 

GTC ATG GGC 555 
Val Met Gly 
140 

GAA ACT CTT 603 
Glu Thr Leu 
155 

GAC TGT GTT 651 
Asp Cys Val 
170 

CAT GGT GTT 699 
His Gly Val 

TCT GGC TTG 747 
Ser Gly Leu 

TTG ACA AAC 795 
Leu Thr Asn 
220 

TGG AGA ACA 843 
Trp Arg Thr 
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w 



15 



20 



25 



30 



35 



40 



45 



50 



AAG CAA AAC 
Lys Gin Asn 

GGC ATA TAT 
Gly He Tyr 
255 

TAT TTT AAT 
Tyr Phe Asn 

270 
TGG ATG ATT 
Trp Met He 
285 

CAA GCG CCG 
Gin Ala Pro 

AAG GAG AAA 
Lys Glu Lys 



GTC TGC AAC 
Val Cys Asn 
335 

AAT CTG CGA 
Asn Leu Arg 

350 
CAC TCA TCA 

His Ser Ser 
365 

ATG AAA CCA 



225 230 235 

CTA GAT TAC TGT TTT CTA ATG ATG TAT GCT CAA GAA AAG 891 
Leu Asp Tyr Cys Phe Leu Met Met Tyr Ala Gin Glu Lys 
240 245 250 

TAC ATT CAG CTT GAA GAT GAT ATT ATT GTC AAA CAA AAT 939 
Tyr He Gin Leu Glu Asp Asp He He Val Lys Gin Asn 

260 265 
ACC ATA AAA AAT TTT GCA CTT CAA CTT TCT TCT GAG GAA 987 
Thr He Lys Asn Phe Ala Leu Gin Leu Ser Ser Glu Glu 

i 

275 280 
CTA GAG TTT TCC CAG CTG GGC TTC ATT GGT AAA ATG TTT 1035 
Leu Glu Phe Ser Gin Leu Gly Phe He Gly Lys Met Phe 
290 295 300 

GAT CTT ACT CTG ATT GTA GAA TTC ATA TTC ATG TTT TAC 1083 
Asp Leu Thr Leu He Val Glu Phe He Phe Met Phe Tyr 

305 310 315 

CCC ATT GAT TGG CTC CTG GAC CAT ATT CTC TGG GTG AAA 1131 
Pro He Asp Trp Leu Leu Asp His He Leu Trp Val Lys 
320 325 330 

CCT GAA AAA GAT GCA AAA CAT TGT GAT AGA CAG AAA GCA 1179 
Pro Glu Lys Asp Ala Lys His Cys Asp Arg Gin Lys Ala 

340 345 
ATT CGC TTC AGA CCT TCC CTT TTC CAA CAT GTT GGT CTG 1227 
He Arg Phe Arg Pro Ser Leu Phe Gin His Val Gly Leu 

355 360 
CTA TCA GGA AAA ATC CAA AAA CTC ACG GAT AAA GAT TAT 1275 
Leu Ser Gly Lys He Gin Lys Leu Thr Asp Lys Asp Tyr 
370 375 380 

TTA CTT CTT AAA ATC CAT GTA AAC CCA CCT GCG GAG GTA 1323 
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70 



75 



Met Lys Pro Leu Leu Leu Lys lie His Val Asn Pro Pro Ala Glu Val 

385 390 395 

TCT ACT TCC TTG AAG GTC TAC CAA GGG CAT ACG CTG GAG AAA ACT TAC 1371 
Ser Thr Ser Leu Lys Val Tyr Gin Gly His Thr Leu Glu Lys Thr Tyr 

400 405 410 

ATG GGA GAG GAT TTC TTC TGG GCT ATC ACA CCG ATA GCT GGA GAC TAC 1419 
Met Gly Glu Asp Phe Phe Trp Ala He Thr Pro He Ala Gly Asp Tyr 

415 420 425 

ATC TTG TTT AAA TTT GAT AAA CCA GTC AAT GTA GAA AGT TAT TTG TTC 1467 
He Leu Phe Lys Phe Asp Lys Pro Val Asn Val Glu Ser Tyr Leu Phe 
20 430 435 440 

CAT AGC GGC AAC CAA GAA CAT CCT GGA GAT ATT CTG CTA AAC ACA ACT 1515 
His Ser Gly Asn Gin Glu His Pro Gly Asp He Leu Leu Asn Thr Thr 
25 445 450 455 460 

GTG GAA GTT TTG CCT TTT AAG AGT GAA GGT TTG GAA ATA AGC AAA GAA 1563 
Val Glu Val Leu Pro Phe Lys Ser Glu Gly Leu Glu He Ser Lys Glu 

465 470 475 

ACC AAA GAC AAA CGA TTA GAA GAT GGC TAT TTC AGA ATA GGA AAA TTT 1611 
Thr Lys Asp Lys Arg Leu Glu Asp Gly Tyr Phe Arg He Gly Lys Phe 

480 485 490 

GAG AAT GGT GTT GCA GAA GGA ATG GTG GAT CCA AGT CTC AAT CCC ATT 1659 
Glu Asn Gly Val Ala Glu Gly Met Val Asp Pro Ser Leu Asn Pro He 

495 500 505 

TCA GCC TTT CGA CTT TCA GTT ATT CAG AAT TCT GCT GTT TGG GCC ATT 1707 
45 Ser Ala Phe Arg Leu Ser Val He Gin Asn Ser Ala Val Trp Ala He 

510 515 520 

CTT AAT GAG ATT CAT ATT AAA AAA GCC ACC AAC TGATCATCTG AGAAACCAAC 1760 
Leu Asn Glu He His He Lys Lys Ala Thr Asn 
525 530 535 



30 



35 



40 



50 
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ACATTTTTTC CTGTGAATTT GTTAATTAAA GATAGTTAAG CATGTATCTT TTTTTTATTT 1820 
CTACTTGAAC ACTACCTCTT GTGAAGTCTA CTGTAGATAA GACGATTGTC ATTTCCACTT 1880 
GGAAAGTGAA TCTCCCATAA TAATTGTATT TGTTTGAAAC TAAGCTGTCC TCAGATTTTA 1940 
ACTTGACTCA AACATTTTTC AATTATGACA GCCTGTTAAT ATGACTTGTA CTATTTTGGT 2000 
ATTATACTAA TACATAAGAG TTGTACATAT TGTTACATTC TTTAAATTTG AGAAAAACTA 2060 
ATGTTACATA CATTTTATGA AGGGGGTACT TTTGAGGTTC ACTTATTTTA CTATT 2115 



SEQ ID NO: 24 
SEQUENCE LENGTH: 535 
SEQUENCE TYPE: amino acid 
TOPOLOGY: linear 
MOLECULE TYPE: peptide 



SEQUENCE DESCRIPTION 
Met Arg Leu Arg Asn Gly Thr Val 

1 5 
Ser Phe Leu Thr Leu Ser Trp Tyr 
20 

Lys Leu lie Ala Tyr Gin Arg Glu 
35 40 
Arg He Ala Glu His Arg He Ser 

50 55 
He Val Gin Gin Phe Lys Arg Val 
65 70 
Asp Ala Leu Asn Lys Phe Ser Asp 
85 

Leu Thr Ser Lys Lys Ser Leu Gin 
100 

Pro His Leu Leu Lys Asn Glu Gly 



Ala Thr Ala Leu Ala Phe He Thr 

10 15 
Thr Thr Trp Gin Asn Gly Lys Glu 

25 30 
Phe Leu Ala Leu Lys Glu Arg Leu 
45 

Gin Arg Ser Ser Glu Leu Asn Thr 
60 

Gly Ala Glu Thr Asn Gly Ser Lys 
75 SO 
Asn Thr Leu Lys Leu Leu Lys Glu 

90 95 
Val Pro Ser He Tyr Tyr His Leu 
105 110 
Ser Leu Gin Pro Ala Val Gin He 
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115 120 125 

Gly Asn Gly Arg Thr Gly Val Ser He Val Met Gly He Pro Thr Val 

130 135 140 

Lys Arg Glu Val Lys Ser Tyr Leu He Glu Thr Leu His Ser Leu He 
145 150 155 160 

Asp Asn Leu Tyr Pro Glu Glu Lys Leu Asp Cys Val He Val Val Phe 

165 170 175 

He Gly Glu Thr Asp He Asp Tyr Val His Gly Val Val Ala Asn Leu 

180 185 190 

Glu Lys Glu Phe Ser Lys Glu He Ser Ser Gly Leu Val Glu Val He 

!95 200 205 

Ser Pro Pro Glu Ser Tyr Tyr Pro Asp Leu Thr Asn Leu Lys Glu Thr 

210 215 220 

Phe Gly Asp Ser Lys Glu Arg Val Arg Trp Arg Thr Lys Gin Asn Leu 
225 230 235 240 

Asp Tyr Cys Phe Leu Met Met Tyr Ala Gin Glu Lys Gly He Tyr Tyr 

245 250 255 

He Gin Leu Glu Asp Asp He He Val Lys Gin Asn Tyr Phe Asn Thr 

260 265 270 

He Lys Asn Phe Ala Leu Gin Leu Ser Ser Glu Glu Trp Met He Leu 

275 280 285 

Glu Phe Ser Gin Leu Gly Phe He Gly Lys Met Phe Gin Ala Pro Asp 

290 295 300 

Leu Thr Leu He Val Glu Phe He Phe Met Phe Tyr Lys Glu Lys Pro 
305 310 315 320 

He Asp Trp Leu Leu Asp His lie Leu Trp Val Lys Val Cys Asn Pro 

325 330 335 

Glu Lys Asp Ala Lys His Cys Asp Arg Gin Lys Ala Asn Leu Arg He 
340 345 350 
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Arg Phe Arg Pro Ser Leu Phe Gin His Val Gly Leu His Ser Ser Leu 

355 360 365 

Ser Gly Lys He Gin Lys Leu Thr Asp Lys Asp Tyr Met Lys Pro Leu 

370 375 380 

Leu Leu Lys He His Val Asn Pro Pro Ala Glu Val Ser Thr Ser Leu 
385 390 395 400 

Lys Val Tyr Gin Gly His Thr Leu Glu Lys Thr Tyr Met Gly Glu Asp 

405 410 415 

Phe Phe Trp Ala He Thr Pro He Ala Gly Asp Tyr He Leu Phe Lys 

420 425 430 

Phe Asp Lys Pro Val Asn Val Glu Ser Tyr Leu Phe His Ser Gly Asn 

435 440 445 

Gin Glu His Pro Gly Asp He Leu Leu Asn Thr Thr Val Glu Val Leu 

450 455 460 

Pro Phe Lys Ser Glu Gly Leu Glu He Ser Lys Glu Thr Lys Asp Lys 
465 470 475 480 

Arg Leu Glu Asp Gly Tyr Phe Arg He Gly Lys Phe Glu Asn Gly Val 

485 490 495 

Ala Glu Gly Met Val Asp Pro Ser Leu Asn Pro He Ser Ala Phe Arg 

500 505 510 

Leu Ser Val He Gin Asn Ser Ala Val Trp Ala He Leu Asn Glu He 

51 5 520 525 

His He Lys Lys Ala Thr Asn 
530 535 

SEQ ID NO: 25 
SEQUENCE LENGTH.: 30 
SEQUENCE TYPE: nucleic acid 
STRANDEDNESS : single 
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TOPOLOGY: linear 

MOLECULE TYPE: synthetic DNA 

SEQUENCE DESCRIPTION 

TTCTCGAGAT GAGGCTCCGC AATGGAACTG 

SEQ ID NO: 26 

SEQUENCE LENGTH: 24 

SEQUENCE TYPE: nucleic acid 

STRANDEDNESS: single 

TOPOLOGY: linear 

MOLECULE TYPE: synthetic DNA 

SEQUENCE DESCRIPTION 
AGAAATGTGG GCTTCAGGGC TGGC 24 

SEQ ID NO: 27 

SEQUENCE LENGTH: 30 

SEQUENCE TYPE: nucleic acid 

STRANDEDNESS: single 

TOPOLOGY: linear 

MOLECULE TYPE: synthetic DNA 

SEQUENCE DESCRIPTION 

TTCTCGAGAT GAGGCTCCGC AATGGAACTG 

SEQ ID NO: 28 
SEQUENCE LENGTH: 24 
SEQUENCE TYPE: nucleic acid 
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STRANDEDNESS: single 

TOPOLOGY: linear 

MOLECULE TYPE: synthetic DNA 

SEQUENCE DESCRIPTION 

AGAAATGTGG GCTTCAGGGC TGGC 24 

SEQ ID NO: 29 

SEQUENCE LENGTH: 30 

SEQUENCE TYPE: nucleic acid 

STRANDEDNESS: single 

TOPOLOGY: linear 

MOLECULE TYPE: synthetic DNA 

SEQUENCE DESCRIPTION 

TTCTCGAGAT GAGGCTCCGC AATGGAACTG 

SEQ ID NO: 30 

SEQUENCE LENGTH: 24 

SEQUENCE TYPE: nucleic acid 

STRANDEDNESS: single 

TOPOLOGY: linear 

MOLECULE TYPE: synthetic DNA 

SEQUENCE DESCRIPTION 

AGAAATGTGG GCTTCAGGGC TGGC 24 

SEQ ID NO: 31 
SEQUENCE LENGTH: 25 
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SEQUENCE TYPE: nucleic acid 
STRANDEDNESS : single 
TOPOLOGY: linear 
MOLECULE TYPE: synthetic DNA 

SEQUENCE DESCRIPTION 
TTCCATCACC TGCCACACCT GCTGG 25 

SEQ ID NO: 32 

SEQUENCE LENGTH: 24 

SEQUENCE TYPE: nucleic acid 

STRANDEDNESS: single 

TOPOLOGY: linear 

MOLECULE TYPE: synthetic DNA 

SEQUENCE DESCRIPTION 
ACAACCCTCA GTCAGACAAG GAGG 24 

SEQ ID NO: 33 

SEQUENCE LENGTH: 24 

SEQUENCE TYPE: nucleic acid 

STRANDEDNESS: single 

TOPOLOGY: linear 

MOLECULE TYPE: synthetic DNA 

SEQUENCE DESCRIPTION 
ACACCCCCAG AAATGTGGGC TTCA 24 

SEQ ID NO: 34 
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10 



15 



30 



35 



40 



SEQUENCE LENGTH: 24 
SEQUENCE TYPE: nucleic acid 
STRANDEDNESS : single 
TOPOLOGY: linear 
MOLECULE TYPE: synthetic DNA 

SEQUENCE DESCRIPTION 
ATGACCGAGT CCTCCTTCTC CTGC 24 



SEQ ID NO: 35 
20 SEQUENCE LENGTH: 24 

SEQUENCE TYPE: nucleic acid 

STRANDEDNESS: single 
25 TOPOLOGY: linear 

MOLECULE TYPE: synthetic DNA 



SEQUENCE DESCRIPTION 
ATGCCCATCA CCACCGACAC TCCG 24 



SEQ ID NO: 36 
SEQUENCE LENGTH: 1724 
SEQUENCE TYPE: nucleic acid 
STRANDEDNESS: double 
TOPOLOGY: linear 
45 MOLECULE TYPE: cDNA 



SEQUENCE DESCRIPTION 
50 TGCAGCCTCG GCCCCGCGGG CGCCCGCCGC GCACCCGAGG AG ATG AGG CTC CGC 

Met Arg Leu Arg 
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JO 



15 



20 



25 



30 



35 



40 



45 



SO 



AAT GGC ACC TTC CTG ACG CTG CTG CTC TTC TGC CTG TGC GCC TTC CTC 102 
Asn Gly Thr Phe Leu Thr Leu Leu Leu Phe Cys Leu Cys Ala Phe Leu 
5 10 15 20 

TCG CTG TCC TGG TAC GCG GCA CTC AGC GGC CAG AAA GGC GAC GTT GTG 150 
Set Leu Ser Trp Tyr Ala Ala Leu Ser Gly Gin Lys Gly Asp Val Val 

25 30 35 

GAC GTT TAC CAG CGG GAG TTC CTG GCG CTG CGC GAT CGG TTG CAC GCA 198 
Asp Val Tyr Gin Arg Glu Phe Leu Ala Leu Arg Asp Arg Leu His Ala 

40 45 50 

GCT GAG CAG GAG AGC CTC AAG CGC TCC AAG GAG CTC AAC CTG GTG CTG 246 
Ala Glu Gin Glu Ser Leu Lys Arg Ser Lys Glu Leu Asn Leu Val Leu 

55 60 65 

GAC GAG ATC AAG AGG GCC GTG TCA GAA AGG CAG GCG CTG CGA GAC GGA 294 
Asp Glu He Lys Arg Ala Val Ser Glu Arg Gin Ala Leu Arg Asp Gly 

70 75 80 

GAC GGC AAT CGC ACC TGG GGC CGC CTA ACA GAG GAC CCC CGA TTG AAG 342 
Asp Gly Asn Arg Thr Trp Gly Arg Leu Thr Glu Asp Pro Arg Leu Lys 
85 90 95 100 

CCG TGG AAC GGC TCA CAC CGG CAC GTG CTG CAC CTG CCC ACC GTC TTC 390 
Pro Trp Asn Gly Ser His Arg His Val Leu His Leu Pro Thr Val Phe 

105 HO 115 

CAT CAC CTG CCA CAC CTG CTG GCC AAG GAG AGC AGT CTG CAG CCC GCG 438 
His His Leu Pro His Leu Leu Ala Lys Glu Ser Ser Leu Gin Pro Ala 

120 125 130 

GTG CGC GTG GGC CAG GGC CGC ACC GGA GTG TCG GTG GTG ATG GGC ATC 486 
Val Arg Val Gly Gin Gly Arg Thr Gly Val Ser Val Val Met Gly He 

13 5 140 145 

CCG AGC GTG CGG CGC GAG GTG CAC TCG TAC CTG ACT GAC ACT CTG CAC 534 
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Pro Ser Val Arg Arg Glu Val His Ser Tyr Leu Thr Asp Thr Leu His 

150 155 160 

TCG CTC ATC TCC GAG CTG AGC CCG CAG GAG AAG GAG GAC TCG GTC ATC 582 

Ser Leu He Ser Glu Leu Ser Pro Gin Glu Lys Glu Asp Ser Val He 

nn 175 I 80 

165 I 70 A,:> 

GTG GTG CTG ATC GCC GAG ACT GAC TCA CAG TAC ACT TCG GCA GTG ACA 630 
Val Val Leu lie Ala Glu Thr Asp Ser Gin Tyr Thr Ser Ala Val Thr 

185 190 195 

GAG AAC ATC AAG GCC TTG TTC CCC ACQ GAG ATC CAT TCT GGG CTC CTG 678 
Glu Asn He Lys Ala Leu Phe Pro Thr Glu He His Ser Gly Leu Leu 

200 205 210 

GAG GTC ATC TCA CCC TCC CCC CAC TTC TAC CCT GAC TTC TCC CGC CTC 726 
Glu Val He Ser Pro Ser Pro His Phe Tyr Pro Asp Phe Ser Arg Leu 

215 220 225 

CGA GAG TCC TTT GGG GAC CCC AAG GAG AGA GTC AGG TGG AGG ACC AAA 774 
Arg Glu Ser Phe Gly Asp Pro Lys Glu Arg Val Arg Trp Arg Thr Lys 

230 235 240 

CAG AAC CTC GAT TAC TGC TTC CTC ATG ATG TAC GCG CAG TCC AAA GGC 822 
Gin Asn Leu Asp Tyr Cys Phe Leu Met Met Tyr Ala Gin Ser Lys Gly 
245 250 255 260 

ATC TAC TAC GTG CAG CTG GAG GAT GAC ATC GTG GCC AAG CCC AAC TAC 870 
He Tyr Tyr Val Gin Leu Glu Asp Asp lie Val Ala Lys Pro Asn Tyr 

265 270 275 

CTG AGC ACC ATG AAG AAC TTT GCA CTG CAG CAG CCT TCA GAG GAC TGG 918 
Leu Ser Thr Met Lys Asn Phe Ala Leu Gin Gin Pro Ser Glu Asp Trp 

280 285 290 

ATG ATC CTG GAG TTC TCC CAG CTG GGC TTC ATT GGT AAG ATG TTC AAG 966 
Met He Leu Glu Phe Ser Gin Leu Gly Phe He Gly Lys Met Phe Lys 
295 300 305 
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TCG CTG GAC CTG AGC CTG ATT GTA GAG TTC ATT CTC ATG TTC TAC CGG 1014 
Ser Leu Asp Leu Ser Leu lie Val Glu Phe He Leu Met Phe Tyr Arg 

310 315 320 

GAC AAG CCC ATC GAC TGG CTC CTG GAC CAT ATT CTG TGG GTG AAA GTC 1062 
Asp Lys Pro lie Asp Trp Leu Leu Asp His lie Leu Trp Val Lys Val 
325 330 335 340 

TGC AAC CCC GAG AAG GAT GCG AAG CAC TGT GAC CGG CAG AAA GCC AAC 1110 
Cys Asn Pro Glu Lys Asp Ala Lys His Cys Asp Arg Gin Lys Ala Asn 

345 350 355 ^ 

CTG CGG ATC CGC TTC AAA CCG TCC CTC TTC CAG CAC GTG GGC ACT CAC 1158 
Leu Arg lie Arg Phe Lys Pro Ser Leu Phe Gin His Val Gly Thr His 

360 365 37° 

TCC TCG CTG GCT GGC AAG ATC CAG AAA CTG AAG GAC AAA GAC TTT GGA 1206 
25 ser Ser Leu Ala Gly Lys He Gin Lys Leu Lys Asp Lys Asp Phe Gly 

375 380 385 

AAG CAG GCG CTG CGG AAG GAG CAT GTG AAC CCG CCA GCA GAG GTG AGC 1254 
Lys Gin Ala Leu Arg Lys Glu His Val Asn Pro Pro Ala Glu Val Ser 

390 395 400 

ACG AGC CTG AAG ACA TAC CAG CAC TTC ACC CTG GAG AAA GCC TAC CTG 1302 
Thr Ser Leu Lys Thr Tyr Gin His Phe Thr Leu Glu Lys Ala Tyr Leu 
405 410 415 420 

CGC GAG GAC TTC TTC TGG GCC TTC ACC CCT GCC GCG GGG GAC TTC ATC 1350 
Arg Glu Asp Phe Phe Tr P >Ala Phe Thr Pro Ala Ala Gly Asp Phe lie 

425 430 435 

CGC TTC CGC TTC TTC CAA CCT CTA AGA CTG GAG CGG TTC TTC TTC CGC 1398 
Arg Phe Arg Phe Phe Gin Pro Leu Arg Leu Glu Arg Phe Phe Phe Arg 

440 445 450 

AGT GGG AAC ATC GAG CAC CCG GAG GAC AAG CTC TTC AAC ACG TCT GTG 1446 
Ser Gly Asn He Glu His Pro Glu Asp Lys Leu Phe Asn Thr Ser Val 
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455 460 465 

GAG GTG CTG CCC TTC GAC AAC CCT CAG TCA GAC AAG GAG GCC CTG CAG 1494 
Glu Val Leu Pro Phe Asp Asn Pro Gin Ser Asp Lys Glu Ala Leu Gin 

470 475 480 

GAG GGC CGC ACC GCC ACC CTC CGG TAC CCT CGG AGC CCC GAC GGC TAC 1542 
Glu Gly>Arg Thr Ala Thr Leu Arg Tyr Pro Arg Ser Pro Asp Gly Tyr 
485 490 495 500 

CTC CAG ATC GGC TCC TTC TAC AAG GGA GTG GCA GAG GGA GAG GTG GAC 1590 
Leu Gin He Gly Ser Phe Tyr Lys Gly Val Ala Glu Gly Glu Val Asp 

505 510 515 

CCA GCC TTC GGC CCT CTG GAA GCA CTG CGC CTC TCG ATC CAG ACG GAC 1638 
Pro Ala Phe Gly Pro Leu Glu Ala Leu Arg Leu Ser He Gin Thr Asp 

520 525 530 

TCC CCT GTG TGG GTG ATT CTG AGC GAG ATC TTC CTG AAA AAG GCC GAC 1686 
Ser Pro Val Trp Val lie Leu Ser Glu He Phe Leu Lys Lys Ala Asp 
535 540 545 548 

TAAGCTGCGG GCTTCTGAGG GTACCCTGTG GCCAGCCC 1724 



SEQ ID NO: 37 
SEQUENCE LENGTH: 548 
SEQUENCE TYPE: amino acid 
TOPOLOGY: linear 
MOLECULE TYPE: peptide 

SEQUENCE DESCRIPTION 

Met Arg Leu Arg Asn Gly Thr Phe 

1 .5 
Cys Ala Phe Leu Ser Leu Ser Trp 
20 



Leu Thr Leu Leu Leu Phe Cys Leu 

10 15 
Tyr Ala Ala Leu Ser Gly Gin Lys 
25 30 
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Gly Asp Val Val Asp Val Tyr Gin Arg Glu Phe Leu Ala Leu Arg Asp 

35 40 45 

Arg Leu His Ala Ala Glu Gin Glu Ser Leu Lys Arg Ser Lys Glu Leu 

50 55 60 

Asn Leu Val Leu Asp Glu lie Lys Arg Ala Val Ser Glu Arg Gin Ala 
65 70 75 80 

Leu Arg Asp Gly Asp Gly Asn Arg Thr Trp Gly Arg Leu Thr Glu Asp 

85 90 95 

Pro Arg Leu Lys Pro Trp Asn Gly Ser His Arg His Val Leu His Leu 

100 105 HO 

Pro Thr Val Phe His His Leu Pro His Leu Leu Ala Lys Glu Ser Ser 

115 120 125 

Leu Gin Pro Ala Val Arg Val Gly Gin Gly Arg Thr Gly Val Ser Val 

130 135 140 

Val Met Gly He Pro Ser Val Arg Arg Glu Val His Ser Tyr Leu Thr 
145 150 155 160 

Asp Thr Leu His Ser Leu He Ser Glu Leu Ser Pro Gin Glu Lys Glu 

165 170 175 

Asp Ser Val He Val Val Leu He Ala Glu Thr Asp Ser Gin Tyr Thr 

180 185 190 

Ser Ala Val Thr Glu Asn He Lys Ala Leu Phe Pro Thr Glu He His 

195 200 205 

Ser Gly Leu Leu Glu Val He Ser Pro Ser Pro His Phe Tyr Pro Asp 

210 215 220 

Phe Ser Arg Leu Arg Glu Ser Phe Gly Asp Pro Lys Glu Arg Val Arg 
225 230 235 240 

Trp Arg Thr Lys Gin Asn Leu Asp Tyr Cys Phe Leu Met Met Tyr Ala 

245 250 255 

Gin Ser Lys Gly He Tyr Tyr Val Gin Leu Glu Asp Asp He Val Ala 
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260 



265 



270 



Lys Pro Asn Tyr Leu Ser Thr Met Lys Asn Phe Ala Leu Gin Gin Pro 

275 280 2 85 

Ser Glu Asp Trp Met He Leu Glu Phe Ser Gin Leu Gly Phe He Gly 

290 295 300 

Lys Met Phe Lys Ser Leu Asp Leu Ser Leu He Val Glu Phe He Leu 
305 310 315 320 

Met Phe Tyr Arg Asp Lys Pro He Asp Trp Leu Leu Asp His He Leu 

325 330 335 

Trp Val Lys Val Cys Asn Pro Glu Lys Asp Ala Lys His Cys Asp Arg 

340 345 350 

Gin Lys Ala Asn Leu Arg He Arg Phe Lys Pro Ser Leu Phe Gin His 

355 360 365 

Val Gly Thr His Ser Ser Leu Ala Gly Lys He Gin Lys Leu Lys Asp 

370 375 380 

Lys Asp Phe Gly Lys Gin Ala Leu Arg Lys Glu His Val Asn Pro Pro 
385 390 395 400 

Ala Glu Val Ser Thr Ser Leu Lys Thr Tyr Gin His Phe Thr Leu Glu 

405 410 415 

Lys Ala Tyr Leu Arg Glu Asp Phe Phe Trp Ala Phe Thr Pro Ala Ala 

420 425 430 

Gly Asp Phe He Arg Phe Arg Phe Phe Gin Pro Leu Arg Leu Glu Arg 

435 440 445 

Phe Phe Phe Arg Ser Gly Asn He Glu His Pro Glu Asp Lys Leu Phe 

450 455 460 

Asn Thr Ser Val Glu Val Leu Pro Phe Asp Asn Pro Gin Ser Asp Lys 
465 - 470 475 480 

Glu Ala Leu Gin Glu Gly Arg Thr Ala Thr Leu Arg Tyr Pro Arg Ser 
485 490 495 
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Pro Asp Gly Tyr Leu Gin lie Gly Ser Phe Tyr Lys Gly Val Ala Glu 

500 505 510 

Gly Glu Val Asp Pro Ala Phe Gly Pro Leu Glu Ala Leu Arg Leu Ser 

5X5 520 525 

He Gin Thr Asp Ser Pro Val Trp Val He Leu Ser Glu He Phe Leu 

530 535 540 

Lys Lys Ala Asp 
545 548 



SEQ ID NO: 38 

SEQUENCE LENGTH: 29 

SEQUENCE TYPE: nucleic acid 

STRANDEDNESS : single 

TOPOLOGY: linear 

MOLECULE TYPE: synthetic DNA 

SEQUENCE DESCRIPTION 

TTCTCGAGGA GATGAGGCTC CGCAATGGC 29 



SEQ ID NO: 39 

SEQUENCE LENGTH: 30 

SEQUENCE TYPE: nucleic acid 

STRANDEDNESS: single 

TOPOLOGY: linear 

MOLECULE TYPE: synthetic DNA 



SEQUENCE DESCRIPTION 

AATCTAGAAA TGTGGGCTTC AGGGCTGGC 30 - 
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SEQ ID NO: 40 

SEQUENCE LENGTH: 28 

SEQUENCE TYPE: nucleic acid 

STRANDEDNESS: single 

TOPOLOGY: linear 

MOLECULE TYPE: synthetic DNA 

SEQUENCE DESCRIPTION 

CCCTCGAG ATG GGG GTG CAC GAA TGT CC 28 
Met Gly Val His Glu Cys Pro 

SEQ ID NO: 41 

SEQUENCE LENGTH: 48 

SEQUENCE TYPE: nucleic acid 

STRANDEDNESS: single 

TOPOLOGY: linear 

MOLECULE TYPE: synthetic DNA 

ANTISENSE: Yes 

SEQUENCE DESCRIPTION 

CTTTTTGCTT GTTAACTCCT TTAGTATTGG GGCGCCCAGG ACTGGGAG 

SEQ ID NO: 42 

SEQUENCE LENGTH: 48 

SEQUENCE TYPE: nucleic acid 

STRANDEDNESS : s i ng 1 e 

TOPOLOGY: linear 

MOLECULE TYPE: synthetic DNA 
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SEQUENCE DESCRIPTION 

CTCCCAGTCC TGGGCGCCCC AATACTAAAG GAGTTAACAA GCAAAAAG 

SEQ ID NO: 43 

SEQUENCE LENGTH: 44 

SEQUENCE TYPE: nucleic acid 

STRANDEDNESS: single 

TOPOLOGY: linear 

MOLECULE TYPE: synthetic DNA 

ANTISENSE: Yes 

SEQUENCE DESCRIPTION 

CTTCCTTCAT TTTGCAATAA ATGAGGGGCG CCCAGGACTG GGAG 44 

SEQ ID NO: 44 

SEQUENCE LENGTH: 43 

SEQUENCE TYPE: nucleic acid 

STRANDEDNESS: single 

TOPOLOGY: linear 

MOLECULE TYPE: synthetic DNA 

ANTISENSE: Yes 

SEQUENCE DESCRIPTION 

ATTTAACTTC TCTCTTCACT GTAGGGGCGC CCAGGACTGG GAG 43 

SEQ ID NO: 45 
SEQUENCE LENGTH: 44 
SEQUENCE TYPE: nucleic acid 
STRANDEDNESS: single 
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TOPOLOGY: linear 

MOLECULE TYPE: synthetic DNA 

SEQUENCE DESCRIPTION 

CTCCCAGTCC TGGGCGCCCC TCATTTATTG CAAAATGAAG GAAG 

SEQ ID NO: 46 

SEQUENCE LENGTH: 43 

SEQUENCE TYPE: nucleic acid 

STRANDEDNESS : single 

TOPOLOGY: linear 

MOLECULE TYPE: synthetic DNA 

SEQUENCE DESCRIPTION 

CTCCCAGTCC TGGGCGCCCC TACAGTGAAG AGAGAAGTTA AAT 

SEQ ID NO: 47 

SEQUENCE LENGTH: 26 

SEQUENCE TYPE: nucleic acid 

STRANDEDNESS: single 

TOPOLOGY : linear 

MOLECULE TYPE: synthetic DNA 

ANT I SENSE: Yes 

SEQUENCE DESCRIPTION 

TTCTAGAA TCA CCC TTC CGC AAC ACC 26 
- TRM Gly Glu Ala Val Gly 



SEQ ID NO: 48 
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SEQUENCE LENGTH: 28 
SEQUENCE TYPE: nucleic acid 
STRANDEDNESS: single 
TOPOLOGY : 1 inear 
MOLECULE TYPE: synthetic DNA 
ANTI SENSE: Yes 

SEQUENCE DESCRIPTION 

TTCTAGAA TCA AGG CAG AAC TTC CAC CG 28 
TRM Pro Leu Val Glu Val Thr 

SEQ ID NO: 49 

SEQUENCE LENGTH: 38 

SEQUENCE TYPE: nucleic acid 

STRANDEDNESS: single 

TOPOLOGY: linear 

MOLECULE TYPE: synthetic DNA 

ANTISENSE: Yes 

SEQUENCE DESCRIPTION 

TTCTAGAA TCA TTT AAA TAG GAT GTA GTC TCC AGC TAC 
TRM Lys Phe Leu lie Tyr Asp Gly Ala Val 

SEQ ID NO: 50 

SEQUENCE LENGTH: 34 

SEQUENCE TYPE: nucleic acid 

STRANDEDNESS: single 

TOPOLOGY: linear 

MOLECULE TYPE: synthetic DNA 



EP0 905 232 A1 



ANTI SENSE: Yes 
SEQUENCE DESCRIPTION 

TTCTAGAA TCA TGG TTT CAT GTA ATC TTT ATC CG 
TRM Pro Lys Met Tyr Asp Lys Asp Thr 

SEQ ID NO: 51 

SEQUENCE LENGTH: 19 

SEQUENCE TYPE: nucleic acid 

STRANDEDNESS : single 

TOPOLOGY: linear 

MOLECULE TYPE: synthetic DNA 

SEQUENCE DESCRIPTION 
TTC ATG AGG CTC CGA AAT G 19 
Met Arg Leu Arg Asn Gly 

SEQ ID NO: 52 

SEQUENCE LENGTH: 29 

SEQUENCE TYPE: nucleic acid 

STRANDEDNESS: single 

TOPOLOGY: linear 

MOLECULE TYPE: synthetic DNA 

ANT I SENSE: Yes 

SEQUENCE DESCRIPTION 

CAAGCT TCA GTT TGT GAC TTT TTT AAT AT 29 
TRM Asn Thr Val Lys Lys lie His 
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SEQ ID NO: 53 

SEQUENCE LENGTH: 27 

SEQUENCE TYPE: nucleic acid 

STRANDEDNESS: single 

TOPOLOGY: linear 

MOLECULE TYPE: synthetic DNA 

SEQUENCE DESCRIPTION 

TC ATG ATA CTA AAG GAG TTA ACA AGC A 27 
Met He Leu Lys Glu Leu Thr Ser Lys 

SEQ ID NO: 54 

SEQUENCE LENGTH: 51 

SEQUENCE TYPE: nucleic acid 

STRANDEDNESS: single 

TOPOLOGY: linear 

MOLECULE TYPE: synthetic DNA 

ANT I SENSE: Yes 

SEQUENCE DESCRIPTION 

CAAGCT TCA GTG GTG GTG GTG GTG GTG GTT TGT GAC TTT 
TRM His His His His His His Asn Thr Val Lys 

TTT AAT ATG GAT 51 
Lys He His He 

SEQ ID NO: 55 
SEQUENCE LENGTH: 18 
SEQUENCE TYPE: nucleic acid 
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STRANDEDNESS : single 
TOPOLOGY: linear 
MOLECULE TYPE: synthetic DNA 
ANTISENSE: Yes 

SEQUENCE DESCRIPTION 
CAAGCT TCA GTG GTG GTG 18 
TRM His His His 

SEQ ID NO: 56 

SEQUENCE LENGTH: 32 

SEQUENCE TYPE: nucleic acid 

STRANDEDNESS: single 

TOPOLOGY: linear 

MOLECULE TYPE: synthetic DNA 

SEQUENCE DESCRIPTION 

TC ATG ATA TTA AAG GAG TTA ACA AGC AAA AAA 
Met He Leu Lys Glu Leu Thr Ser Lys Lys 

SEQ ID NO: 57 

SEQUENCE LENGTH: 51 

SEQUENCE TYPE: nucleic acid 

STRANDEDNESS: single 

TOPOLOGY: linear 

MOLECULE TYPE: synthetic DNA 

ANTISENSE: Yes 



SEQUENCE DESCRIPTION 
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CAAGCT TCA GTG GTG GTG GTG GTG GTG GTT GGT GGC TTT 3 
TRM His His His His His His Asn Thr Ala Lys 

TTT AAT ATG AAT 51 
Lys lie His He 

SEQ ID NO: 58 

SEQUENCE LENGTH: 49 

SEQUENCE TYPE: nucleic acid 

STRANDEDNESS: single 

TOPOLOGY: linear 

MOLECULE TYPE: synthetic DNA 

ANTI SENSE: Yes 

SEQUENCE DESCRIPTION 

CAAGCT TCA GTG GTG GTG GTG GTG GTG TGG TTT CAT ATA 
TRM His His His His His His Pro Lys Met Tyr 

ATC TTT ATC C 49 
Asp Lys Asp 

SEQ ID NO: 59 

SEQUENCE LENGTH: 19 

SEQUENCE TYPE: nucleic acid 

STRANDEDNESS: single 

TOPOLOGY: linear 

MOLECULE TYPE: synthetic DNA 

SEQUENCE DESCRIPTION 
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TTC ATG AGG CTC CGC AAT G 19 
Met Arg Leu Arg Asn Gly 

SEQ ID NO: 60 

SEQUENCE LENGTH: 18 

SEQUENCE TYPE: nucleic acid 

STRANDEDNESS : single 

TOPOLOGY: linear 

MOLECULE TYPE: synthetic DNA 

ANTI SENSE: Yes 

SEQUENCE DESCRIPTION 
CAGGTTGAGC TCCTTGGA 18 

SEQ ID NO: 61 

SEQUENCE LENGTH: 18 

SEQUENCE TYPE: nucleic acid 

STRANDEDNESS: single 

TOPOLOGY: linear 

MOLECULE TYPE: synthetic DNA 

SEQUENCE DESCRIPTION 
TCCAAGGAGC TCAACCTG 18 

SEQ ID NO: 62 
SEQUENCE LENGTH: 44 
SEQUENCE TYPE: nucleic acid 
STRANDEDNESS: single 
TOPOLOGY: linear 
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MOLECULE TYPE: synthetic DNA 
ANTISENSE: Yes 

SEQUENCE DESCRIPTION 

CAAGCT TCA GTG GTG GTG GTG GTG GTG GTC GGC CTT TTT CAG GA 44 
TRM His His His His His His Asp Als Lys Lys Leu Phe 

SEQ ID NO: 63 

SEQUENCE LENGTH: 23 

SEQUENCE TYPE: nucleic acid 

STRANDEDNESS : single 

TOPOLOGY: linear 

MOLECULE TYPE: synthetic DNA 

SEQUENCE DESCRIPTION 
CCC ATG GGC CGC CTA ACA GAG GA 23 
Met Gly Arg Leu Thr Glu Asp 



Claims 

1. a pi -> 4 N-acetyiglucosaminyltransferasG having an activity to produce a saccharide having a partial structure 
represented by the formula below: 

GlcNAc 01 >** P 1— 4GlcNAc- 

Man a I 



using UDP-GlcNAc as a sugar donor and a saccharide having a partial structure represented by the formula below 
as a sugar receptor: 
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^j^^ P I— ^GlcNAc- 



The p1 -> 4 N-acetylglucosaminyltransferase of claim 1 , wherein the saccharide as a sugar receptor is an oligosac- 
charide, polysaccharide, glycoconjugate (glycopeptide. glycoprotein, glycolipid or proteoglycan) or a denvative 
thereof. 

The p1 -> 4 N-acetylglucosaminyltransferase of claim 1 . wherein the sugar receptor is a saccharide having the par- 
tial structure represented by the formula below: 

RK 

^Man a l ^ 

K 2 jMan (3 l-4GlcNAc— R3 



GlcNAc/Sl— 2Manal<" 



wherein 



R1: H-. Manal -> 6 or GIcNAcpi -» 6 
R2: H-. Manal -> 3 or GJcNAcpi -» 2 
R3: -OH or p1 -» 4GJcNAc -» R4 
R4: -OH. -H. -pyridylamine or -peptide chain. 

4. A pi -» 4 N-acetylglucosarrinyltransferase conprising the amino acid sequences shown in SEQ ID NOS: 1-14 as 
partial sequences. 

5 a B1 -> 4 N-acetylglucosaminyltransferase consisting of the amino acid sequence shown in SEQ ID NO: 18 or the 
amino acid sequence shown in SEQ ID NO: 18 which has addition, deletion or substitution of one or more amino 
acid residues and yet which produces p1 -> 4 N-acetylglucosaminyKransferase activity. 

6 A p1 -► 4 N-acetylglucosaminyltransferase consisting of a partial sequence of the amino acid sequence shown in 
SEQ ID NO- 18 including at least from position 93 to position 383 residues, or said partial sequence which has 
addition, deletion or substitution of one or more amino acid residues and yet which produces p1 -» 4 N-acetylgJu- 
cosaminyltransferase activity. 

7 AB1 -> 4 N-acetytgluoosaminyltransferase consisting of the amino acid sequence shown in SEQ ID NO: 24 or the 
amino acid sequence shown in SEQ ID NO: 24 which has addition, deletion or substitution of one or more amino 
acid residues and yet which produces pi ->4 N-acetylglucosaminyttransf erase activity. 

8 Ap1 -> 4 N-acetylglucosaminyHransferase consisting of a partial sequence of the amino acid sequence shown in 
SEQ ID NO- 24 including at least from position 94 to position 383 residues, or said partial sequence which has 
addition, deletion or substitutio'n of one or more amino acid residues and yet which produces p1 -> 4 N-acetylglu- 
cosaminyttransferase activity. 

9 a p1 -* 4 N-acetylglucosaminyltransferase consisting of the amino acid sequence shown in SEQ ID NO: 37 or the 
amino acid sequence shown in SEQ ID NO: 37 which has addition, deletion or substitution of one or more amino 
acid residues and yet which produces p1 4 N-acetytglucosaminyttransferase activity. 

10 A p1 -> 4 N-acetylglucosaminyltransferase consisting of a partial sequence of the amino acid sequence shown in 
SEQ ID NO: 37 including at least from position 91 to position 390 residues, or said partial sequence which has 
addition deletion or substitution of one or more amino acid residues and yet which produces p1 -> 4 N-acetylglu- 



15 



20 
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cosaminyHransferase activity. 

1 1 . A p1 -> 4 N-acetylglucosaminyrtransf erase gene coding for the p1 4 N-acetylglucosaminyltransf erase of claim 5 
or 6. 

5 

12. A p1 -> 4 N-acetylglucosaminyltransf erase gene coding for the p1 -> 4 N-acetylglucosaminyltransf erase of claim 7 
or 8. 

13. A p1 -> 4 N-acetylglucosaminyttransferase gene coding for the pi -> 4 N-acetytglucosaminyrtransf erase of claim 9 
jo or 10. 

14. A pi -> 4 N-acetylglucosaminyltransferase gene consisting of the nucleotide sequence shown in SEQ ID NO: 17. 

15. A p1 -» 4 N-acetylg!ucosaminy1transferase gene consisting of the nucleotide sequence shown in SEQ ID NO: 23. 

16. A pi -> 4 N-acetylglucosaminyltransferase gene consisting of the nucleotide sequence shown in SEQ ID NO: 36. 

17. A recombinant DNA obtained by inserting into a vector DNA the p1 -> 4 N-acetylglucosarninyrtransferase gene of 
any one of claims 1 1 to 16. 

18. A chromosomal fragment comprising a part or all of the p1 -> 4 N-acetylglucosaminyltransferase gene of any one 
of claims 11 to 16. 

19. A host cell carrying the recombinant DNA of claim 17. 

20. A host cell into which the chromosomal fragment of claim 18 is artificially introduced. 

21 . A method for purifying the p1 4 N-acetylglucosaminyltransferase of any one of claims 1 to 3 from biological sam- 
ples. 

22. A method for producing a p1 -> 4 N-acetylglucosaminyltransferase comprising culturing the host cell of claim 19 in 
a medium and recovering the p1 4 N-acetylglucosaminyltransferase from the resultant culture. 

23. A method for producing a p1 4 N-acetylglucosaminyrtransferase comprising culturing the host cell of claim 20 in 
35 a medium and recovering the p1 -> 4 N-acetylglucosaminyrtransferase from the resultant culture. 

24. A method for producing a p1 -> 4 N-acetylglucosaminyltransferase comprising recovering the p1 4 N-acetylglu- 
cosaminyftransferase from the secreta, body fluid or homogenate originated from the host cell of claim 19. 

40 25. A method for producing a pi -> 4 N-acetylglucosaminyrtransferase comprising recovering the p1 4 N-acetylglu- 
cosaminyrtransferase from the secreta. body fluid or homogenate originated from the host cell of claim 20. 

26. A saccharide of which the sugar chain structure is modified with the p1 4 N-acetylglucosaminyrtransferase of any 
one of claims 1 to 3. 

45 

27. The saccharide of claim 26, which is an oligosaccharide, a glycopeptide, a glycoprotein or a derivative thereof. 

28. A method tor modifying a branching structure of a sugar chain of a glycoprotein produced by a host cell, comprising 
introducing into the host cell the pi -> 4 N-acetylglucosaminyltransferase gene of any one of claims 1 1 to 16. 



25 



30 



50 



29. A glycoprotein in which a branching structure of a sugar chain is modified by the method of daim 28. 

30. The glycoprotein of claim 29. which is erythropoietin. 



55 
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Biosynthetic Pathways of Asn-Linked Sugar Chains 
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FIG.2 

Variations of Asn-Linked Sugar Chains 

Revised from Fig.1, Makoto Takeuchi, Wako 
Purechemical Newsletter 64,18-19.1996 



a. Mannan type b. Xvlo-high-ma nnose type 




c. High mannose type d. Hybrid type 




e. Complex type 



f. Prokaryotic ceil 
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FIG.5 

High Performance Liquid Chromatogram of 
GnT-IV Reaction Product 
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FIG.6 




Elution Volume (liter) 
Q Sepharose FF Chromatography 



FIG.7 




Elution Volume (ml) 
Cu Chelate Sepharose FF Chromatography 
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